Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study
Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) lang...
Saved in:
Published in | International journal of advanced computer science & applications Vol. 7; no. 10 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
West Yorkshire
Science and Information (SAI) Organization Limited
01.01.2016
|
Subjects | |
Online Access | Get full text |
ISSN | 2158-107X 2156-5570 2156-5570 |
DOI | 10.14569/IJACSA.2016.071019 |
Cover
Abstract | Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) languages that contain postposition. We use the Urdu language as a case study. We compare the F-measure values obtained for the tagging schemes IO, BIO2, BILOU and BIL2 using Hidden Markov Model (HMM) and Conditional Random Field (CRF). The BIL2 tagging scheme results are better than the other three tagging schemes using the same parameters including bigram and context window. With HMM, the F-measure values for IO, BIO2, BILOU, and BIL2 are 44.87%, 44.88%, 45.14%, and 45.88%, respectively. With CRF, the F-measure values for IO, BIO2, BILOU, and BIL2 are 35.13%, 35.90%, 37.85%, and 38.39%, respectively. The F-measure values for BIL2 are better than those of previously reported techniques |
---|---|
AbstractList | Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) languages that contain postposition. We use the Urdu language as a case study. We compare the F-measure values obtained for the tagging schemes IO, BIO2, BILOU and BIL2 using Hidden Markov Model (HMM) and Conditional Random Field (CRF). The BIL2 tagging scheme results are better than the other three tagging schemes using the same parameters including bigram and context window. With HMM, the F-measure values for IO, BIO2, BILOU, and BIL2 are 44.87%, 44.88%, 45.14%, and 45.88%, respectively. With CRF, the F-measure values for IO, BIO2, BILOU, and BIL2 are 35.13%, 35.90%, 37.85%, and 38.39%, respectively. The F-measure values for BIL2 are better than those of previously reported techniques |
Author | Mansoor, Syed Kamran, Muhammad |
Author_xml | – sequence: 1 givenname: Muhammad surname: Kamran fullname: Kamran, Muhammad – sequence: 2 givenname: Syed surname: Mansoor fullname: Mansoor, Syed |
BookMark | eNp9kFFLwzAUhYNMcM79Al8CPncmzdI0vo0xdTJUNge-lds0HR1dU5MU6b-3W32a4H25l8s5B853jQaVqTRCt5RM6JRH8n75MptvZpOQ0GhCBCVUXqBhSHkUcC7I4HTHASXi8wqNnduTbpgMo5gN0foVDjrDi8oXvsVrrcyuKnxhKrxpndcHnBuL343ztXGnP5R4BdWugZ12D3hrswaDw4Dn4DTe-CZrb9BlDqXT4989QtvHxcf8OVi9PS3ns1WgQh76QEghVARxFnLIZAxSRIrpFDSTYkpTkeaMMUVznvGuRhYD0VrmACrnLJVhzkZo2uc2VQ3tN5RlUtviALZNKElOaJJiD8pBckST9Gg6211vq635arTzyd40tuvlkjDiESdCEt6pWK9S1jhndf43u8d-li3PXKrwcMTmLRTlv94f_8CIig |
CitedBy_id | crossref_primary_10_1007_s10489_022_03274_0 crossref_primary_10_32604_cmes_2021_017491 |
ContentType | Journal Article |
Copyright | 2016. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2016. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | AAYXX CITATION 3V. 7XB 8FE 8FG 8FK 8G5 ABUWG AFKRA ARAPS AZQEC BENPR BGLVJ CCPQU DWQXO GNUQQ GUQSH HCIFZ JQ2 K7- M2O MBDVC P5Z P62 PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS Q9U ADTOC UNPAY |
DOI | 10.14569/IJACSA.2016.071019 |
DatabaseName | CrossRef ProQuest Central (Corporate) ProQuest Central (purchase pre-March 2016) ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) (purchase pre-March 2016) Research Library (Alumni) ProQuest Central (Alumni) ProQuest Central UK/Ireland Advanced Technologies & Aerospace Collection ProQuest Central Essentials - QC ProQuest Central ProQuest Technology Collection ProQuest One ProQuest Central Korea ProQuest Central Student ProQuest Research Library SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database Research Library Research Library (Corporate) Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection Proquest Central Premium ProQuest One Academic (New) ProQuest Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic Unpaywall for CDI: Periodical Content Unpaywall |
DatabaseTitle | CrossRef Publicly Available Content Database Research Library Prep Computer Science Database ProQuest Central Student Technology Collection ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College Research Library (Alumni Edition) ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Central Korea ProQuest Research Library ProQuest Central (New) Advanced Technologies & Aerospace Collection ProQuest Central Basic ProQuest One Academic Eastern Edition ProQuest Technology Collection ProQuest SciTech Collection Advanced Technologies & Aerospace Database ProQuest One Academic UKI Edition ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISSN | 2156-5570 |
ExternalDocumentID | 10.14569/ijacsa.2016.071019 10_14569_IJACSA_2016_071019 |
Genre | Report Case Study |
GroupedDBID | .DC 5VS 8G5 AAYXX ABUWG ADMLS AFKRA ALMA_UNASSIGNED_HOLDINGS ARAPS AZQEC BENPR BGLVJ CCPQU CITATION DWQXO EBS EJD GNUQQ GUQSH HCIFZ K7- KQ8 M2O OK1 PHGZM PHGZT PIMPY PQGLB PUEGO RNS 3V. 7XB 8FE 8FG 8FK JQ2 MBDVC P62 PKEHL PQEST PQQKQ PQUKI PRINS Q9U ADTOC UNPAY |
ID | FETCH-LOGICAL-c252t-7977c6a8d25ad98a976c3ebae39741b7bf333c1f5d5557d8a0ee9faacf53b92f3 |
IEDL.DBID | BENPR |
ISSN | 2158-107X 2156-5570 |
IngestDate | Tue Aug 19 22:20:02 EDT 2025 Fri Jul 25 04:16:02 EDT 2025 Wed Oct 01 04:49:12 EDT 2025 Thu Apr 24 23:09:58 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | true |
Issue | 10 |
Language | English |
License | cc-by |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c252t-7977c6a8d25ad98a976c3ebae39741b7bf333c1f5d5557d8a0ee9faacf53b92f3 |
Notes | ObjectType-Case Study-2 SourceType-Scholarly Journals-1 content type line 14 ObjectType-Report-1 |
OpenAccessLink | https://www.proquest.com/docview/2656507905?pq-origsite=%requestingapplication%&accountid=15518 |
PQID | 2656507905 |
PQPubID | 5444811 |
ParticipantIDs | unpaywall_primary_10_14569_ijacsa_2016_071019 proquest_journals_2656507905 crossref_primary_10_14569_IJACSA_2016_071019 crossref_citationtrail_10_14569_IJACSA_2016_071019 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20160101 |
PublicationDateYYYYMMDD | 2016-01-01 |
PublicationDate_xml | – month: 01 year: 2016 text: 20160101 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | West Yorkshire |
PublicationPlace_xml | – name: West Yorkshire |
PublicationTitle | International journal of advanced computer science & applications |
PublicationYear | 2016 |
Publisher | Science and Information (SAI) Organization Limited |
Publisher_xml | – name: Science and Information (SAI) Organization Limited |
SSID | ssj0000392683 |
Score | 1.9971102 |
Snippet | Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name,... |
SourceID | unpaywall proquest crossref |
SourceType | Open Access Repository Aggregation Database Enrichment Source Index Database |
SubjectTerms | Case studies |
SummonAdditionalLinks | – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwvV1Lb9NAEB6V9AAXWl4itKA9cGQb2-v1o7cotCoVRFUhKJxWsw9LQOREdSIUfga_mNmsnQZVQuLCzZbWO5JnvPN969lvAF5r47JcaOQFakkERZS8qKThQmalpIQS4WYr-8M4u5ikl1M53YNuM4dAT4PhD_5bLxY_R9sMPm--03w8j6PBFS6Iu8clHyNlC3Xmz7Ku1XVXbDOvVVD6VgT5lO9329U-kdvetzuAREBtdQ_2M0mEuwf7k_HV8ItvQkdchntJqnDtZU_zaatURDijHHz9hqbxYkWxV_yMN_I8u9nsFqLeX9ULXP_A2WwnW50fwK_uzE8oUvl-slrqE_PzrgTk_3sRh_Cwxb5sGIL1Eey5-jEcdH0lWLvMPIHrjTEWjLEdYywYY2SM_WmMbY2dssmNXTFsGLIRpWjmKyXXT2FyfvZpdMHb3g_cJDJZ8pxwqcmwsIlEWxZIqMkIp9ERfkpjnetKCGHiSlpJHrMFRs6VFaKppNBlUoln0KvntXsOzEVInM8SddNpKlNHHFNbWlur0qC0qelD0vlQmVYY3ffnmClPkLzj1bvL4ejjUHnHq-D4PrzZPrQIuiB_H37cBYdqF4lGJYSlCY6XkewD3wbM3elC2O1O9-Ifxx_BA38Xto6Oobe8WbmXBKaW-lUb_r8BaIclOQ priority: 102 providerName: Unpaywall |
Title | Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study |
URI | https://www.proquest.com/docview/2656507905 http://thesai.org/Downloads/Volume7No10/Paper_19-Named_Entity_Recognition_System_for_Postpositional_Languages.pdf |
UnpaywallVersion | publishedVersion |
Volume | 7 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVAFT databaseName: Colorado Digital library customDbUrl: eissn: 2156-5570 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0000392683 issn: 2158-107X databaseCode: KQ8 dateStart: 20100101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 2156-5570 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0000392683 issn: 2158-107X databaseCode: BENPR dateStart: 20100101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1La9tAEB4S55Bc-kpCnaZhDz1WxNZq9QiE4Bq7qUmEcWJwTmL2IUgxsusHwf8-s9LKMRRy0mU1iJndmW9Gs98A_JDKhBGX6MUoBSUoPPHiXCiPizARFFBaWJay79PwdhwMJmKyB2l9F8a2VdY-sXTUeqZsjfzSJ-BB2CVpiZv5P89OjbJ_V-sRGuhGK-jrkmJsHw6sSw4acPCrlw5H26oLfYsfltycFOosr2k0cVREBCSSyz-DTvehYxu-LKVnu-Tf2Q1Xbxj0cF3McfOC0-lOOOp_gg8OR7JOZfjPsGeKL_CxntHA3JE9hlGKFO9Yz97G3bBR3S40K1jFVc4ItDI7sbfu3iKhd66Gubxi44VeM1wyZF0Kd8x2HW5OYNzvPXZvPTdHwVO-8FdeRBhPhRhrX6BOYiQEoriRaAiLBG0ZyZxzrtq50EKISMfYMibJEVUuuEz8nJ9Co5gV5isw00LKnzSlQTIIRGAoX5Oa_FSeKBQ6UE3wa3VlypGM21kX08wmG1bHWaXjzOo4q3TchJ_bl-YVx8b7y89rO2TuwC2zt-3RBG9rm__FPf9FtcRdcWfvi_sGR3ZxVXU5h8ZqsTbfCYes5AXsx_3fF26L0XOcDjtPr5FW2zk |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEB6V9FAuvBGBAnuAG1aTXa8fSBUKIVXSphEKjZSbmX1YAkVOqBNV-XP8NmbtdRoJqbfevSN5dj3fN-PZbwA-KG2jWCgMElSSEhSRBkkudSBklEoClA5WpezLSTSchedzOT-Av81dGNdW2cTEKlCbpXY18hNOxIO4S9qRX1Z_Ajc1yv1dbUZooB-tYE4riTF_sePCbm8ohStPR99ovz9yfja46g8DP2Ug0FzydRATA9IRJoZLNGmChM9aWIWWkDrsqljlQgjdzaWRUsYmwY61aY6ocylUynNBdh_AIdEOHrbg8Otg8n26q_LQu_Oo0gIlaHU6qvHcSx8RcUlPRue9_o-eazBzEqLdSu9nHx5vOe_Rpljh9gYXiz34O3sCjzxvZb36oD2FA1s8g8fNTAjmQ8RzmE6Q8JUN3O3fLZs27UnLgtXa6IxIMnMTgptuMTI69jXT8jObXZsNw5Ih6xO8MtfluH0Bs3vx6EtoFcvCvgJmO0j5mqG0S4WhDC3lh8pQXMxTjdKEug28cVemvai5m62xyFxy43yc1T7OnI-z2sdt-LRbtKo1Pe5-_LjZh8x_4GV2exzbEOz25n9zv36jLnHf3Ou7zb2Ho-HV5TgbjyYXb-ChW1hXfI6htb7e2LfEgdbqnT9oDH7e99n-B8v9FxU |
linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwvV1Lb9NAEB6V9AAXWl4itKA9cGQb2-v1o7cotCoVRFUhKJxWsw9LQOREdSIUfga_mNmsnQZVQuLCzZbWO5JnvPN969lvAF5r47JcaOQFakkERZS8qKThQmalpIQS4WYr-8M4u5ikl1M53YNuM4dAT4PhD_5bLxY_R9sMPm--03w8j6PBFS6Iu8clHyNlC3Xmz7Ku1XVXbDOvVVD6VgT5lO9329U-kdvetzuAREBtdQ_2M0mEuwf7k_HV8ItvQkdchntJqnDtZU_zaatURDijHHz9hqbxYkWxV_yMN_I8u9nsFqLeX9ULXP_A2WwnW50fwK_uzE8oUvl-slrqE_PzrgTk_3sRh_Cwxb5sGIL1Eey5-jEcdH0lWLvMPIHrjTEWjLEdYywYY2SM_WmMbY2dssmNXTFsGLIRpWjmKyXXT2FyfvZpdMHb3g_cJDJZ8pxwqcmwsIlEWxZIqMkIp9ERfkpjnetKCGHiSlpJHrMFRs6VFaKppNBlUoln0KvntXsOzEVInM8SddNpKlNHHFNbWlur0qC0qelD0vlQmVYY3ffnmClPkLzj1bvL4ejjUHnHq-D4PrzZPrQIuiB_H37cBYdqF4lGJYSlCY6XkewD3wbM3elC2O1O9-Ifxx_BA38Xto6Oobe8WbmXBKaW-lUb_r8BaIclOQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Named+Entity+Recognition+System+for+Postpositional+Languages%3A+Urdu+as+a+Case+Study&rft.jtitle=International+journal+of+advanced+computer+science+%26+applications&rft.au=Kamran%2C+Muhammad&rft.au=Mansoor%2C+Syed&rft.date=2016-01-01&rft.issn=2158-107X&rft.eissn=2156-5570&rft.volume=7&rft.issue=10&rft_id=info:doi/10.14569%2FIJACSA.2016.071019&rft.externalDBID=n%2Fa&rft.externalDocID=10_14569_IJACSA_2016_071019 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2158-107X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2158-107X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2158-107X&client=summon |