Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study

Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) lang...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of advanced computer science & applications Vol. 7; no. 10
Main Authors Kamran, Muhammad, Mansoor, Syed
Format Journal Article
LanguageEnglish
Published West Yorkshire Science and Information (SAI) Organization Limited 01.01.2016
Subjects
Online AccessGet full text
ISSN2158-107X
2156-5570
2156-5570
DOI10.14569/IJACSA.2016.071019

Cover

Abstract Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) languages that contain postposition. We use the Urdu language as a case study. We compare the F-measure values obtained for the tagging schemes IO, BIO2, BILOU and BIL2 using Hidden Markov Model (HMM) and Conditional Random Field (CRF). The BIL2 tagging scheme results are better than the other three tagging schemes using the same parameters including bigram and context window. With HMM, the F-measure values for IO, BIO2, BILOU, and BIL2 are 44.87%, 44.88%, 45.14%, and 45.88%, respectively. With CRF, the F-measure values for IO, BIO2, BILOU, and BIL2 are 35.13%, 35.90%, 37.85%, and 38.39%, respectively. The F-measure values for BIL2 are better than those of previously reported techniques
AbstractList Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) languages that contain postposition. We use the Urdu language as a case study. We compare the F-measure values obtained for the tagging schemes IO, BIO2, BILOU and BIL2 using Hidden Markov Model (HMM) and Conditional Random Field (CRF). The BIL2 tagging scheme results are better than the other three tagging schemes using the same parameters including bigram and context window. With HMM, the F-measure values for IO, BIO2, BILOU, and BIL2 are 44.87%, 44.88%, 45.14%, and 45.88%, respectively. With CRF, the F-measure values for IO, BIO2, BILOU, and BIL2 are 35.13%, 35.90%, 37.85%, and 38.39%, respectively. The F-measure values for BIL2 are better than those of previously reported techniques
Author Mansoor, Syed
Kamran, Muhammad
Author_xml – sequence: 1
  givenname: Muhammad
  surname: Kamran
  fullname: Kamran, Muhammad
– sequence: 2
  givenname: Syed
  surname: Mansoor
  fullname: Mansoor, Syed
BookMark eNp9kFFLwzAUhYNMcM79Al8CPncmzdI0vo0xdTJUNge-lds0HR1dU5MU6b-3W32a4H25l8s5B853jQaVqTRCt5RM6JRH8n75MptvZpOQ0GhCBCVUXqBhSHkUcC7I4HTHASXi8wqNnduTbpgMo5gN0foVDjrDi8oXvsVrrcyuKnxhKrxpndcHnBuL343ztXGnP5R4BdWugZ12D3hrswaDw4Dn4DTe-CZrb9BlDqXT4989QtvHxcf8OVi9PS3ns1WgQh76QEghVARxFnLIZAxSRIrpFDSTYkpTkeaMMUVznvGuRhYD0VrmACrnLJVhzkZo2uc2VQ3tN5RlUtviALZNKElOaJJiD8pBckST9Gg6211vq635arTzyd40tuvlkjDiESdCEt6pWK9S1jhndf43u8d-li3PXKrwcMTmLRTlv94f_8CIig
CitedBy_id crossref_primary_10_1007_s10489_022_03274_0
crossref_primary_10_32604_cmes_2021_017491
ContentType Journal Article
Copyright 2016. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2016. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID AAYXX
CITATION
3V.
7XB
8FE
8FG
8FK
8G5
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
GNUQQ
GUQSH
HCIFZ
JQ2
K7-
M2O
MBDVC
P5Z
P62
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
ADTOC
UNPAY
DOI 10.14569/IJACSA.2016.071019
DatabaseName CrossRef
ProQuest Central (Corporate)
ProQuest Central (purchase pre-March 2016)
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni) (purchase pre-March 2016)
Research Library (Alumni)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials - QC
ProQuest Central
ProQuest Technology Collection
ProQuest One
ProQuest Central Korea
ProQuest Central Student
ProQuest Research Library
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
Research Library
Research Library (Corporate)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Proquest Central Premium
ProQuest One Academic (New)
ProQuest Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
Publicly Available Content Database
Research Library Prep
Computer Science Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
Research Library (Alumni Edition)
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Central Korea
ProQuest Research Library
ProQuest Central (New)
Advanced Technologies & Aerospace Collection
ProQuest Central Basic
ProQuest One Academic Eastern Edition
ProQuest Technology Collection
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
DatabaseTitleList Publicly Available Content Database
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2156-5570
ExternalDocumentID 10.14569/ijacsa.2016.071019
10_14569_IJACSA_2016_071019
Genre Report
Case Study
GroupedDBID .DC
5VS
8G5
AAYXX
ABUWG
ADMLS
AFKRA
ALMA_UNASSIGNED_HOLDINGS
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
CITATION
DWQXO
EBS
EJD
GNUQQ
GUQSH
HCIFZ
K7-
KQ8
M2O
OK1
PHGZM
PHGZT
PIMPY
PQGLB
PUEGO
RNS
3V.
7XB
8FE
8FG
8FK
JQ2
MBDVC
P62
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
Q9U
ADTOC
UNPAY
ID FETCH-LOGICAL-c252t-7977c6a8d25ad98a976c3ebae39741b7bf333c1f5d5557d8a0ee9faacf53b92f3
IEDL.DBID BENPR
ISSN 2158-107X
2156-5570
IngestDate Tue Aug 19 22:20:02 EDT 2025
Fri Jul 25 04:16:02 EDT 2025
Wed Oct 01 04:49:12 EDT 2025
Thu Apr 24 23:09:58 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 10
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c252t-7977c6a8d25ad98a976c3ebae39741b7bf333c1f5d5557d8a0ee9faacf53b92f3
Notes ObjectType-Case Study-2
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Report-1
OpenAccessLink https://www.proquest.com/docview/2656507905?pq-origsite=%requestingapplication%&accountid=15518
PQID 2656507905
PQPubID 5444811
ParticipantIDs unpaywall_primary_10_14569_ijacsa_2016_071019
proquest_journals_2656507905
crossref_primary_10_14569_IJACSA_2016_071019
crossref_citationtrail_10_14569_IJACSA_2016_071019
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20160101
PublicationDateYYYYMMDD 2016-01-01
PublicationDate_xml – month: 01
  year: 2016
  text: 20160101
  day: 01
PublicationDecade 2010
PublicationPlace West Yorkshire
PublicationPlace_xml – name: West Yorkshire
PublicationTitle International journal of advanced computer science & applications
PublicationYear 2016
Publisher Science and Information (SAI) Organization Limited
Publisher_xml – name: Science and Information (SAI) Organization Limited
SSID ssj0000392683
Score 1.9971102
Snippet Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name,...
SourceID unpaywall
proquest
crossref
SourceType Open Access Repository
Aggregation Database
Enrichment Source
Index Database
SubjectTerms Case studies
SummonAdditionalLinks – databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwvV1Lb9NAEB6V9AAXWl4itKA9cGQb2-v1o7cotCoVRFUhKJxWsw9LQOREdSIUfga_mNmsnQZVQuLCzZbWO5JnvPN969lvAF5r47JcaOQFakkERZS8qKThQmalpIQS4WYr-8M4u5ikl1M53YNuM4dAT4PhD_5bLxY_R9sMPm--03w8j6PBFS6Iu8clHyNlC3Xmz7Ku1XVXbDOvVVD6VgT5lO9329U-kdvetzuAREBtdQ_2M0mEuwf7k_HV8ItvQkdchntJqnDtZU_zaatURDijHHz9hqbxYkWxV_yMN_I8u9nsFqLeX9ULXP_A2WwnW50fwK_uzE8oUvl-slrqE_PzrgTk_3sRh_Cwxb5sGIL1Eey5-jEcdH0lWLvMPIHrjTEWjLEdYywYY2SM_WmMbY2dssmNXTFsGLIRpWjmKyXXT2FyfvZpdMHb3g_cJDJZ8pxwqcmwsIlEWxZIqMkIp9ERfkpjnetKCGHiSlpJHrMFRs6VFaKppNBlUoln0KvntXsOzEVInM8SddNpKlNHHFNbWlur0qC0qelD0vlQmVYY3ffnmClPkLzj1bvL4ejjUHnHq-D4PrzZPrQIuiB_H37cBYdqF4lGJYSlCY6XkewD3wbM3elC2O1O9-Ifxx_BA38Xto6Oobe8WbmXBKaW-lUb_r8BaIclOQ
  priority: 102
  providerName: Unpaywall
Title Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study
URI https://www.proquest.com/docview/2656507905
http://thesai.org/Downloads/Volume7No10/Paper_19-Named_Entity_Recognition_System_for_Postpositional_Languages.pdf
UnpaywallVersion publishedVersion
Volume 7
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Colorado Digital library
  customDbUrl:
  eissn: 2156-5570
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000392683
  issn: 2158-107X
  databaseCode: KQ8
  dateStart: 20100101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 2156-5570
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000392683
  issn: 2158-107X
  databaseCode: BENPR
  dateStart: 20100101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1La9tAEB4S55Bc-kpCnaZhDz1WxNZq9QiE4Bq7qUmEcWJwTmL2IUgxsusHwf8-s9LKMRRy0mU1iJndmW9Gs98A_JDKhBGX6MUoBSUoPPHiXCiPizARFFBaWJay79PwdhwMJmKyB2l9F8a2VdY-sXTUeqZsjfzSJ-BB2CVpiZv5P89OjbJ_V-sRGuhGK-jrkmJsHw6sSw4acPCrlw5H26oLfYsfltycFOosr2k0cVREBCSSyz-DTvehYxu-LKVnu-Tf2Q1Xbxj0cF3McfOC0-lOOOp_gg8OR7JOZfjPsGeKL_CxntHA3JE9hlGKFO9Yz97G3bBR3S40K1jFVc4ItDI7sbfu3iKhd66Gubxi44VeM1wyZF0Kd8x2HW5OYNzvPXZvPTdHwVO-8FdeRBhPhRhrX6BOYiQEoriRaAiLBG0ZyZxzrtq50EKISMfYMibJEVUuuEz8nJ9Co5gV5isw00LKnzSlQTIIRGAoX5Oa_FSeKBQ6UE3wa3VlypGM21kX08wmG1bHWaXjzOo4q3TchJ_bl-YVx8b7y89rO2TuwC2zt-3RBG9rm__FPf9FtcRdcWfvi_sGR3ZxVXU5h8ZqsTbfCYes5AXsx_3fF26L0XOcDjtPr5FW2zk
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEB6V9FAuvBGBAnuAG1aTXa8fSBUKIVXSphEKjZSbmX1YAkVOqBNV-XP8NmbtdRoJqbfevSN5dj3fN-PZbwA-KG2jWCgMElSSEhSRBkkudSBklEoClA5WpezLSTSchedzOT-Av81dGNdW2cTEKlCbpXY18hNOxIO4S9qRX1Z_Ajc1yv1dbUZooB-tYE4riTF_sePCbm8ohStPR99ovz9yfja46g8DP2Ug0FzydRATA9IRJoZLNGmChM9aWIWWkDrsqljlQgjdzaWRUsYmwY61aY6ocylUynNBdh_AIdEOHrbg8Otg8n26q_LQu_Oo0gIlaHU6qvHcSx8RcUlPRue9_o-eazBzEqLdSu9nHx5vOe_Rpljh9gYXiz34O3sCjzxvZb36oD2FA1s8g8fNTAjmQ8RzmE6Q8JUN3O3fLZs27UnLgtXa6IxIMnMTgptuMTI69jXT8jObXZsNw5Ih6xO8MtfluH0Bs3vx6EtoFcvCvgJmO0j5mqG0S4WhDC3lh8pQXMxTjdKEug28cVemvai5m62xyFxy43yc1T7OnI-z2sdt-LRbtKo1Pe5-_LjZh8x_4GV2exzbEOz25n9zv36jLnHf3Ou7zb2Ho-HV5TgbjyYXb-ChW1hXfI6htb7e2LfEgdbqnT9oDH7e99n-B8v9FxU
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwvV1Lb9NAEB6V9AAXWl4itKA9cGQb2-v1o7cotCoVRFUhKJxWsw9LQOREdSIUfga_mNmsnQZVQuLCzZbWO5JnvPN969lvAF5r47JcaOQFakkERZS8qKThQmalpIQS4WYr-8M4u5ikl1M53YNuM4dAT4PhD_5bLxY_R9sMPm--03w8j6PBFS6Iu8clHyNlC3Xmz7Ku1XVXbDOvVVD6VgT5lO9329U-kdvetzuAREBtdQ_2M0mEuwf7k_HV8ItvQkdchntJqnDtZU_zaatURDijHHz9hqbxYkWxV_yMN_I8u9nsFqLeX9ULXP_A2WwnW50fwK_uzE8oUvl-slrqE_PzrgTk_3sRh_Cwxb5sGIL1Eey5-jEcdH0lWLvMPIHrjTEWjLEdYywYY2SM_WmMbY2dssmNXTFsGLIRpWjmKyXXT2FyfvZpdMHb3g_cJDJZ8pxwqcmwsIlEWxZIqMkIp9ERfkpjnetKCGHiSlpJHrMFRs6VFaKppNBlUoln0KvntXsOzEVInM8SddNpKlNHHFNbWlur0qC0qelD0vlQmVYY3ffnmClPkLzj1bvL4ejjUHnHq-D4PrzZPrQIuiB_H37cBYdqF4lGJYSlCY6XkewD3wbM3elC2O1O9-Ifxx_BA38Xto6Oobe8WbmXBKaW-lUb_r8BaIclOQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Named+Entity+Recognition+System+for+Postpositional+Languages%3A+Urdu+as+a+Case+Study&rft.jtitle=International+journal+of+advanced+computer+science+%26+applications&rft.au=Kamran%2C+Muhammad&rft.au=Mansoor%2C+Syed&rft.date=2016-01-01&rft.issn=2158-107X&rft.eissn=2156-5570&rft.volume=7&rft.issue=10&rft_id=info:doi/10.14569%2FIJACSA.2016.071019&rft.externalDBID=n%2Fa&rft.externalDocID=10_14569_IJACSA_2016_071019
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2158-107X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2158-107X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2158-107X&client=summon