Data mining and privacy of social network sites' users

Implications of the data mining problem

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

This paper explores the potential of data mining as a technique that could be used by malicious data miners to threaten the privacy of social network sites (SNS) users. It applies a data mining algorithm to a real dataset to provide empirically-based evidence of the ease with which characteristics about the SNS users can be discovered and used in a way that could invade their privacy. One major contribution of this article is the use of the decision forest data mining algorithm (SysFor) to the context of SNS, which does not only build a decision tree but rather a forest allowing the exploration of more logic rules from a dataset. One logic rule that SysFor built in this study, for example, revealed that anyone having a profile picture showing just the face or a picture showing a family is less likely to be lonely. Another contribution of this article is the discussion of the implications of the data mining problem for governments, businesses, developers and the SNS users themselves.
Original languageEnglish
Pages (from-to)941-966
Number of pages26
JournalScience and Engineering Ethics
Volume21
Issue number4
Early online date2014
DOIs
Publication statusPublished - Aug 2015

Fingerprint

Data privacy
Data Mining
Privacy
Social Support
Data mining
privacy
social network
Decision Trees
Miners
Decision trees
miner
Social networks
Industry
evidence
Datasets
Logic

Cite this

@article{be9f8266c1e14c969e74fc9487f42b15,
title = "Data mining and privacy of social network sites' users: Implications of the data mining problem",
abstract = "This paper explores the potential of data mining as a technique that could be used by malicious data miners to threaten the privacy of social network sites (SNS) users. It applies a data mining algorithm to a real dataset to provide empirically-based evidence of the ease with which characteristics about the SNS users can be discovered and used in a way that could invade their privacy. One major contribution of this article is the use of the decision forest data mining algorithm (SysFor) to the context of SNS, which does not only build a decision tree but rather a forest allowing the exploration of more logic rules from a dataset. One logic rule that SysFor built in this study, for example, revealed that anyone having a profile picture showing just the face or a picture showing a family is less likely to be lonely. Another contribution of this article is the discussion of the implications of the data mining problem for governments, businesses, developers and the SNS users themselves.",
keywords = "Content analysis, Data mining, Logic rules, Privacy, Social network site (SNS)",
author = "Yeslam Al-Saggaf and Islam, {Md Zahidul}",
note = "Includes bibliographical references.",
year = "2015",
month = "8",
doi = "10.1007/s11948-014-9564-6",
language = "English",
volume = "21",
pages = "941--966",
journal = "Science and Engineering Ethics",
issn = "1353-3452",
publisher = "Springer Netherlands",
number = "4",

}

TY - JOUR

T1 - Data mining and privacy of social network sites' users

T2 - Implications of the data mining problem

AU - Al-Saggaf, Yeslam

AU - Islam, Md Zahidul

N1 - Includes bibliographical references.

PY - 2015/8

Y1 - 2015/8

N2 - This paper explores the potential of data mining as a technique that could be used by malicious data miners to threaten the privacy of social network sites (SNS) users. It applies a data mining algorithm to a real dataset to provide empirically-based evidence of the ease with which characteristics about the SNS users can be discovered and used in a way that could invade their privacy. One major contribution of this article is the use of the decision forest data mining algorithm (SysFor) to the context of SNS, which does not only build a decision tree but rather a forest allowing the exploration of more logic rules from a dataset. One logic rule that SysFor built in this study, for example, revealed that anyone having a profile picture showing just the face or a picture showing a family is less likely to be lonely. Another contribution of this article is the discussion of the implications of the data mining problem for governments, businesses, developers and the SNS users themselves.

AB - This paper explores the potential of data mining as a technique that could be used by malicious data miners to threaten the privacy of social network sites (SNS) users. It applies a data mining algorithm to a real dataset to provide empirically-based evidence of the ease with which characteristics about the SNS users can be discovered and used in a way that could invade their privacy. One major contribution of this article is the use of the decision forest data mining algorithm (SysFor) to the context of SNS, which does not only build a decision tree but rather a forest allowing the exploration of more logic rules from a dataset. One logic rule that SysFor built in this study, for example, revealed that anyone having a profile picture showing just the face or a picture showing a family is less likely to be lonely. Another contribution of this article is the discussion of the implications of the data mining problem for governments, businesses, developers and the SNS users themselves.

KW - Content analysis

KW - Data mining

KW - Logic rules

KW - Privacy

KW - Social network site (SNS)

U2 - 10.1007/s11948-014-9564-6

DO - 10.1007/s11948-014-9564-6

M3 - Article

VL - 21

SP - 941

EP - 966

JO - Science and Engineering Ethics

JF - Science and Engineering Ethics

SN - 1353-3452

IS - 4

ER -