FamilyID: A Hybrid Approach to Identify Family Information from Microblogs

With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and ea...

Full description

Saved in:
Bibliographic Details
Published inData and Applications Security and Privacy XXIX Vol. 9149; pp. 215 - 222
Main Authors Gopal, Jamuna, Huang, Shu, Luo, Bo
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2015
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3319208098
9783319208091
ISSN0302-9743
1611-3349
1611-3349
DOI10.1007/978-3-319-20810-7_14

Cover

Abstract With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed.
AbstractList With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed.
Author Huang, Shu
Luo, Bo
Gopal, Jamuna
Author_xml – sequence: 1
  givenname: Jamuna
  surname: Gopal
  fullname: Gopal, Jamuna
– sequence: 2
  givenname: Shu
  surname: Huang
  fullname: Huang, Shu
  email: shuang@microsoft.com
– sequence: 3
  givenname: Bo
  surname: Luo
  fullname: Luo, Bo
  email: bluo@ku.edu
BookMark eNqNkc9u1DAQhw0U1G3pG3DwC7h4_N_cVi1tFxVxgbPlxHY3bTYOTlYob19vF_UKpxnN_L45fHOGToY8RIQ-Ab0ESvVnqw3hhIMljBqgRDsQb9AZr5OXAbxFK1AAhHNh370uqDUnaEU5ZcRqwT-glQVmFFeSnqKLaXqklIKURhi-Qt9u_K7rl831F7zGd0tTuoDX41iyb7d4zngT4jB3acHHHN4MKZedn7s84FTyDn_v2pKbPj9MH9H75PspXvyt5-jXzdefV3fk_sft5mp9T7ZcmZmAF8JHRVNoLMgUomKRRx1kSkJCgBC0ibqNQnrVMu7BytpLbRoaZGgSP0fyeHc_jH754_vejaXb-bI4oO5gzlVzjruqw72IcgdzlWNHbqrx4SEW1-T8NP0LEkeoKvm9j9Ps4oFqq5Xi-3brxzmWySlmNLfSgVWOMfq_WP2CpkK9Ys_yDJAl
ContentType Book Chapter
Copyright IFIP International Federation for Information Processing 2015
Copyright_xml – notice: IFIP International Federation for Information Processing 2015
DBID FFUUA
ABOKW
UNPAY
DEWEY 005.8
DOI 10.1007/978-3-319-20810-7_14
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
Unpaywall for CDI: Monographs and Miscellaneous Content
Unpaywall
DatabaseTitleList
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 3319208101
9783319208107
EISSN 1611-3349
Editor Samarati, Pierangela
Editor_xml – sequence: 1
  fullname: Samarati, Pierangela
EndPage 222
ExternalDocumentID oai:HAL:hal-01745822v1
EBC6287395_196_220
EBC5587046_196_220
GroupedDBID 0D6
0DA
38.
AABBV
AAGZE
AAZAK
AAZUS
ABBVZ
ABFTD
ABMNI
ACKNT
ACRRC
AEDXK
AEJLV
AEKFX
AETDV
AEZAY
ALMA_UNASSIGNED_HOLDINGS
APFYR
AZZ
BBABE
CZZ
FFUUA
I4C
IEZ
IY-
LDH
SBO
SFQCF
TMQGW
TPJZQ
TSXQS
TWXRB
Z7R
Z7S
Z7U
Z7X
Z7Y
Z7Z
Z81
Z83
Z84
Z85
Z88
-DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
P2P
RNI
RSU
SVGTG
VI1
~02
ABOKW
UNPAY
ID FETCH-LOGICAL-h368t-1a44ae60fdb915fde62e3e7d5ff451d1dd78e7ce45a6c23a195e45578b0d5dbf3
IEDL.DBID UNPAY
ISBN 3319208098
9783319208091
ISSN 0302-9743
1611-3349
IngestDate Sun Oct 26 04:10:29 EDT 2025
Wed Sep 17 04:00:57 EDT 2025
Thu May 29 01:00:16 EDT 2025
Wed May 28 23:42:57 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
LCCallNum QA76.9.A25QA76.9.D3Q
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-h368t-1a44ae60fdb915fde62e3e7d5ff451d1dd78e7ce45a6c23a195e45578b0d5dbf3
Notes S. Huang and B. Luo—This work was partially supported by NSF CNS-1422206, NSF IIS-1513324, NSF OIA-1308762, and University of Kansas GRF-2301876.
OCLC 912863650
OpenAccessLink https://proxy.k.utb.cz/login?url=https://inria.hal.science/hal-01745822
PQID EBC5587046_196_220
PageCount 8
ParticipantIDs unpaywall_primary_10_1007_978_3_319_20810_7_14
springer_books_10_1007_978_3_319_20810_7_14
proquest_ebookcentralchapters_6287395_196_220
proquest_ebookcentralchapters_5587046_196_220
PublicationCentury 2000
PublicationDate 2015
PublicationDateYYYYMMDD 2015-01-01
PublicationDate_xml – year: 2015
  text: 2015
PublicationDecade 2010
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesSubtitle Information Systems and Applications, incl. Internet/Web, and HCI
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 29th Annual IFIP WG 11. 3 Working Conference, DBSec 2015, Fairfax, VA, USA, July 13-15, 2015, Proceedings
PublicationTitle Data and Applications Security and Privacy XXIX
PublicationYear 2015
Publisher Springer International Publishing AG
Springer International Publishing
Publisher_xml – name: Springer International Publishing AG
– name: Springer International Publishing
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Steffen, Bernhard
Pandu Rangan, C.
Kanade, Takeo
Kittler, Josef
Weikum, Gerhard
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
– sequence: 8
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
– sequence: 9
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
– sequence: 10
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
– sequence: 11
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
– sequence: 12
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
SSID ssj0001558483
ssj0002792
Score 1.7632092
Snippet With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking...
SourceID unpaywall
springer
proquest
SourceType Open Access Repository
Publisher
StartPage 215
SubjectTerms Candidate Tweet
Computer security
Information architecture
Lexical Similarity
Microblog
Online Social Network Data
Semantic Similarity Assessment
Title FamilyID: A Hybrid Approach to Identify Family Information from Microblogs
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5587046&ppg=220
http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6287395&ppg=220
http://link.springer.com/10.1007/978-3-319-20810-7_14
https://inria.hal.science/hal-01745822
UnpaywallVersion submittedVersion
Volume 9149
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA7SHkQPvlFRycGbpO5uHrvxVrRai4oHK3oKySaLoKxit0j99U662Vo8KN7CMoEw32Zmwsw3g9ChZDy2TFpCc0YJWMmMZCKbpgtpTPM0ZtyTk69vRH_IBg_8IRTITrkwJei98wRxZ_AAx7CG927q8ztgatuCQ8zdQu3hzW33sU4RJESGSnrhe_JRJuc4cnX631N1EvB-EUmVJ-zMRZOzBOgSWhyXb3ryoV9e5nzM-Qq6aE5Xl5Y8d8aV6eSfPxo3_n38VbTsGQzYUwtAa2towZXraKWZ34DDdd5Ag3rqxeXZCe7i_sRzt3A39BjH1SuuSbzFBNdyOFCXPJTY01Lwta_mM3C-0SYanvfuTvskzFYgT1RkFYk1Y9qJqLBGxrywTiSOutTyovDoxdammUtzx7gWeUJ1LDms4XqbyHJrCrqFWuVr6bYRjlycGfjuRG6YFlIbxqxIpHNcAyLJDiKNttU0AxzKTvNaCyPFORgNJhQYA5Uk0Z_yAl55VPJv-aMGQuXFR6ppxQzYK6oAezXFXnnsd1BnhrJ6qzt6_Lph978b9lCreh-7fQhWKnOA2t3e4Or-IPyxX0Df5SI
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA5SD6IH36io5OBNUnc3j914K76q0OLBQj2FZJOlYFnFbpH66510s7V4ULyFZQJhvs3MhJlvBqEzyXhsmbSE5owSsJIZyUQ2TxfSmOZpzLgnJ_f6ojtgD0M-DAWycy5MCXpvjyDuDB7gAtbw3k19fgdM7argEHO30Oqg_9h5rlMECZGhkl74nnyUySWOXJ3-91SdBLxfRFLlCTtL0eQiAbqO1qblm5596PF4ycfcbqK75nR1aclLe1qZdv75o3Hj38ffQhuewYA9tQC0to1WXLmDNpv5DThc5130UE-9uL--xB3cnXnuFu6EHuO4esU1ibeY4VoOB-qShxJ7Wgru-Wo-A-eb7KHB7c3TVZeE2QpkREVWkVgzpp2ICmtkzAvrROKoSy0vCo9ebG2auTR3jGuRJ1THksMarreJLLemoPuoVb6W7gDhyMWZge9O5IZpIbVhzIpEOsc1IJIcItJoW80zwKHsNK-1MFGcg9FgQoExUEkS_Skv4JVHJf-WP28gVF58oppWzIC9ogqwV3Pslcf-ELUXKKu3uqPHrxuO_rvhGLWq96k7gWClMqfhT_0CH7PjjQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Data+and+Applications+Security+and+Privacy+XXIX&rft.atitle=FamilyID%3A+A+Hybrid+Approach+to+Identify+Family+Information+from+Microblogs&rft.date=2015-01-01&rft.pub=Springer+International+Publishing+AG&rft.isbn=9783319208091&rft.volume=9149&rft_id=info:doi/10.1007%2F978-3-319-20810-7_14&rft.externalDBID=220&rft.externalDocID=EBC6287395_196_220
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5587046-l.jpg
http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6287395-l.jpg