FamilyID: A Hybrid Approach to Identify Family Information from Microblogs
With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and ea...
        Saved in:
      
    
          | Published in | Data and Applications Security and Privacy XXIX Vol. 9149; pp. 215 - 222 | 
|---|---|
| Main Authors | , , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Switzerland
          Springer International Publishing AG
    
        2015
     Springer International Publishing  | 
| Series | Lecture Notes in Computer Science | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 3319208098 9783319208091  | 
| ISSN | 0302-9743 1611-3349 1611-3349  | 
| DOI | 10.1007/978-3-319-20810-7_14 | 
Cover
| Abstract | With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed. | 
    
|---|---|
| AbstractList | With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed. | 
    
| Author | Huang, Shu Luo, Bo Gopal, Jamuna  | 
    
| Author_xml | – sequence: 1 givenname: Jamuna surname: Gopal fullname: Gopal, Jamuna – sequence: 2 givenname: Shu surname: Huang fullname: Huang, Shu email: shuang@microsoft.com – sequence: 3 givenname: Bo surname: Luo fullname: Luo, Bo email: bluo@ku.edu  | 
    
| BookMark | eNqNkc9u1DAQhw0U1G3pG3DwC7h4_N_cVi1tFxVxgbPlxHY3bTYOTlYob19vF_UKpxnN_L45fHOGToY8RIQ-Ab0ESvVnqw3hhIMljBqgRDsQb9AZr5OXAbxFK1AAhHNh370uqDUnaEU5ZcRqwT-glQVmFFeSnqKLaXqklIKURhi-Qt9u_K7rl831F7zGd0tTuoDX41iyb7d4zngT4jB3acHHHN4MKZedn7s84FTyDn_v2pKbPj9MH9H75PspXvyt5-jXzdefV3fk_sft5mp9T7ZcmZmAF8JHRVNoLMgUomKRRx1kSkJCgBC0ibqNQnrVMu7BytpLbRoaZGgSP0fyeHc_jH754_vejaXb-bI4oO5gzlVzjruqw72IcgdzlWNHbqrx4SEW1-T8NP0LEkeoKvm9j9Ps4oFqq5Xi-3brxzmWySlmNLfSgVWOMfq_WP2CpkK9Ys_yDJAl | 
    
| ContentType | Book Chapter | 
    
| Copyright | IFIP International Federation for Information Processing 2015 | 
    
| Copyright_xml | – notice: IFIP International Federation for Information Processing 2015 | 
    
| DBID | FFUUA ABOKW UNPAY  | 
    
| DEWEY | 005.8 | 
    
| DOI | 10.1007/978-3-319-20810-7_14 | 
    
| DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only Unpaywall for CDI: Monographs and Miscellaneous Content Unpaywall  | 
    
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Computer Science | 
    
| EISBN | 3319208101 9783319208107  | 
    
| EISSN | 1611-3349 | 
    
| Editor | Samarati, Pierangela | 
    
| Editor_xml | – sequence: 1 fullname: Samarati, Pierangela  | 
    
| EndPage | 222 | 
    
| ExternalDocumentID | oai:HAL:hal-01745822v1 EBC6287395_196_220 EBC5587046_196_220  | 
    
| GroupedDBID | 0D6 0DA 38. AABBV AAGZE AAZAK AAZUS ABBVZ ABFTD ABMNI ACKNT ACRRC AEDXK AEJLV AEKFX AETDV AEZAY ALMA_UNASSIGNED_HOLDINGS APFYR AZZ BBABE CZZ FFUUA I4C IEZ IY- LDH SBO SFQCF TMQGW TPJZQ TSXQS TWXRB Z7R Z7S Z7U Z7X Z7Y Z7Z Z81 Z83 Z84 Z85 Z88 -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS P2P RNI RSU SVGTG VI1 ~02 ABOKW UNPAY  | 
    
| ID | FETCH-LOGICAL-h368t-1a44ae60fdb915fde62e3e7d5ff451d1dd78e7ce45a6c23a195e45578b0d5dbf3 | 
    
| IEDL.DBID | UNPAY | 
    
| ISBN | 3319208098 9783319208091  | 
    
| ISSN | 0302-9743 1611-3349  | 
    
| IngestDate | Sun Oct 26 04:10:29 EDT 2025 Wed Sep 17 04:00:57 EDT 2025 Thu May 29 01:00:16 EDT 2025 Wed May 28 23:42:57 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| LCCallNum | QA76.9.A25QA76.9.D3Q | 
    
| Language | English | 
    
| License | cc-by | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-h368t-1a44ae60fdb915fde62e3e7d5ff451d1dd78e7ce45a6c23a195e45578b0d5dbf3 | 
    
| Notes | S. Huang and B. Luo—This work was partially supported by NSF CNS-1422206, NSF IIS-1513324, NSF OIA-1308762, and University of Kansas GRF-2301876. | 
    
| OCLC | 912863650 | 
    
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://inria.hal.science/hal-01745822 | 
    
| PQID | EBC5587046_196_220 | 
    
| PageCount | 8 | 
    
| ParticipantIDs | unpaywall_primary_10_1007_978_3_319_20810_7_14 springer_books_10_1007_978_3_319_20810_7_14 proquest_ebookcentralchapters_6287395_196_220 proquest_ebookcentralchapters_5587046_196_220  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2015 | 
    
| PublicationDateYYYYMMDD | 2015-01-01 | 
    
| PublicationDate_xml | – year: 2015 text: 2015  | 
    
| PublicationDecade | 2010 | 
    
| PublicationPlace | Switzerland | 
    
| PublicationPlace_xml | – name: Switzerland – name: Cham  | 
    
| PublicationSeriesSubtitle | Information Systems and Applications, incl. Internet/Web, and HCI | 
    
| PublicationSeriesTitle | Lecture Notes in Computer Science | 
    
| PublicationSeriesTitleAlternate | Lect.Notes Computer | 
    
| PublicationSubtitle | 29th Annual IFIP WG 11. 3 Working Conference, DBSec 2015, Fairfax, VA, USA, July 13-15, 2015, Proceedings | 
    
| PublicationTitle | Data and Applications Security and Privacy XXIX | 
    
| PublicationYear | 2015 | 
    
| Publisher | Springer International Publishing AG Springer International Publishing  | 
    
| Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing  | 
    
| RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug  | 
    
| RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard  | 
    
| SSID | ssj0001558483 ssj0002792  | 
    
| Score | 1.7632092 | 
    
| Snippet | With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking... | 
    
| SourceID | unpaywall springer proquest  | 
    
| SourceType | Open Access Repository Publisher  | 
    
| StartPage | 215 | 
    
| SubjectTerms | Candidate Tweet Computer security Information architecture Lexical Similarity Microblog Online Social Network Data Semantic Similarity Assessment  | 
    
| Title | FamilyID: A Hybrid Approach to Identify Family Information from Microblogs | 
    
| URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5587046&ppg=220 http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6287395&ppg=220 http://link.springer.com/10.1007/978-3-319-20810-7_14 https://inria.hal.science/hal-01745822  | 
    
| UnpaywallVersion | submittedVersion | 
    
| Volume | 9149 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA7SHkQPvlFRycGbpO5uHrvxVrRai4oHK3oKySaLoKxit0j99U662Vo8KN7CMoEw32Zmwsw3g9ChZDy2TFpCc0YJWMmMZCKbpgtpTPM0ZtyTk69vRH_IBg_8IRTITrkwJei98wRxZ_AAx7CG927q8ztgatuCQ8zdQu3hzW33sU4RJESGSnrhe_JRJuc4cnX631N1EvB-EUmVJ-zMRZOzBOgSWhyXb3ryoV9e5nzM-Qq6aE5Xl5Y8d8aV6eSfPxo3_n38VbTsGQzYUwtAa2towZXraKWZ34DDdd5Ag3rqxeXZCe7i_sRzt3A39BjH1SuuSbzFBNdyOFCXPJTY01Lwta_mM3C-0SYanvfuTvskzFYgT1RkFYk1Y9qJqLBGxrywTiSOutTyovDoxdammUtzx7gWeUJ1LDms4XqbyHJrCrqFWuVr6bYRjlycGfjuRG6YFlIbxqxIpHNcAyLJDiKNttU0AxzKTvNaCyPFORgNJhQYA5Uk0Z_yAl55VPJv-aMGQuXFR6ppxQzYK6oAezXFXnnsd1BnhrJ6qzt6_Lph978b9lCreh-7fQhWKnOA2t3e4Or-IPyxX0Df5SI | 
    
| linkProvider | Unpaywall | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA5SD6IH36io5OBNUnc3j914K76q0OLBQj2FZJOlYFnFbpH66510s7V4ULyFZQJhvs3MhJlvBqEzyXhsmbSE5owSsJIZyUQ2TxfSmOZpzLgnJ_f6ojtgD0M-DAWycy5MCXpvjyDuDB7gAtbw3k19fgdM7argEHO30Oqg_9h5rlMECZGhkl74nnyUySWOXJ3-91SdBLxfRFLlCTtL0eQiAbqO1qblm5596PF4ycfcbqK75nR1aclLe1qZdv75o3Hj38ffQhuewYA9tQC0to1WXLmDNpv5DThc5130UE-9uL--xB3cnXnuFu6EHuO4esU1ibeY4VoOB-qShxJ7Wgru-Wo-A-eb7KHB7c3TVZeE2QpkREVWkVgzpp2ICmtkzAvrROKoSy0vCo9ebG2auTR3jGuRJ1THksMarreJLLemoPuoVb6W7gDhyMWZge9O5IZpIbVhzIpEOsc1IJIcItJoW80zwKHsNK-1MFGcg9FgQoExUEkS_Skv4JVHJf-WP28gVF58oppWzIC9ogqwV3Pslcf-ELUXKKu3uqPHrxuO_rvhGLWq96k7gWClMqfhT_0CH7PjjQ | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Data+and+Applications+Security+and+Privacy+XXIX&rft.atitle=FamilyID%3A+A+Hybrid+Approach+to+Identify+Family+Information+from+Microblogs&rft.date=2015-01-01&rft.pub=Springer+International+Publishing+AG&rft.isbn=9783319208091&rft.volume=9149&rft_id=info:doi/10.1007%2F978-3-319-20810-7_14&rft.externalDBID=220&rft.externalDocID=EBC6287395_196_220 | 
    
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5587046-l.jpg http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6287395-l.jpg  |