Curating GitHub for engineered software projects
Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited m...
Saved in:
| Published in | Empirical software engineering : an international journal Vol. 22; no. 6; pp. 3219 - 3253 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Springer US
01.12.2017
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1382-3256 1573-7616 |
| DOI | 10.1007/s10664-017-9512-6 |
Cover
| Abstract | Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called
reaper
, to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used
reaper
to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population. |
|---|---|
| AbstractList | Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called
reaper
, to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used
reaper
to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population. Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called reaper, to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used reaper to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population. |
| Author | Kroh, Steven Cabrey, Craig Munaiah, Nuthan Nagappan, Meiyappan |
| Author_xml | – sequence: 1 givenname: Nuthan orcidid: 0000-0003-2071-664X surname: Munaiah fullname: Munaiah, Nuthan email: nm6061@rit.edu organization: Department of Software Engineering, Rochester Institute of Technology – sequence: 2 givenname: Steven surname: Kroh fullname: Kroh, Steven organization: Department of Software Engineering, Rochester Institute of Technology – sequence: 3 givenname: Craig surname: Cabrey fullname: Cabrey, Craig organization: Department of Software Engineering, Rochester Institute of Technology – sequence: 4 givenname: Meiyappan surname: Nagappan fullname: Nagappan, Meiyappan organization: David R. Cheriton School of Computer Science, University of Waterloo |
| BookMark | eNp9kEFLAzEQhYNUsK3-AG8LnqOTzSbZPUrRVih40XPI7k7KlprUJIv4701ZQRD0NHN437w3b0Fmzjsk5JrBLQNQd5GBlBUFpmgjWEnlGZkzoThVkslZ3nldUl4KeUEWMe4BoFGVmBNYjcGkwe2K9ZA2Y1tYHwp0u8EhBuyL6G36MAGLY_B77FK8JOfWHCJefc8leX18eFlt6PZ5_bS639Ku4iLRBhGkYox1aIxoa9v0su4ZtL1UBq0sK7TAqpbXVilEhU2vhDGVbbmsO7B8SW6mu9n4fcSY9N6PwWVLXYLkIqfnkFVqUnXBxxjQ6m5I-R_vUjDDQTPQp3r0VI_O9ehTPVpmkv0ij2F4M-HzX6acmJi1bofhJ9Pf0BcCq3kV |
| CitedBy_id | crossref_primary_10_1016_j_infsof_2024_107425 crossref_primary_10_1016_j_infsof_2021_106699 crossref_primary_10_7232_JKIIE_2024_50_2_109 crossref_primary_10_1016_j_eswa_2020_114176 crossref_primary_10_1007_s10664_020_09905_9 crossref_primary_10_1007_s10664_024_10568_z crossref_primary_10_1109_TSE_2023_3244123 crossref_primary_10_1109_ACCESS_2023_3244495 crossref_primary_10_1109_TSE_2022_3224053 crossref_primary_10_1016_j_infsof_2023_107317 crossref_primary_10_1109_TSE_2019_2960357 crossref_primary_10_1016_j_jss_2020_110821 crossref_primary_10_1145_3428236 crossref_primary_10_1007_s10664_021_10058_6 crossref_primary_10_1109_TSE_2021_3063220 crossref_primary_10_3390_fi10030029 crossref_primary_10_1089_hs_2020_0194 crossref_primary_10_1007_s10664_020_09825_8 crossref_primary_10_1109_TSE_2019_2918536 crossref_primary_10_1109_TSE_2021_3051492 crossref_primary_10_1145_3555208 crossref_primary_10_1016_j_jss_2023_111806 crossref_primary_10_1007_s10664_018_9650_5 crossref_primary_10_1142_S0218194023500110 crossref_primary_10_1109_TSE_2021_3073773 crossref_primary_10_1007_s10515_019_00254_6 crossref_primary_10_3390_a14100289 crossref_primary_10_1007_s10664_023_10421_9 crossref_primary_10_1145_3467895 crossref_primary_10_1007_s10664_021_10033_1 crossref_primary_10_1007_s11432_022_3953_4 crossref_primary_10_1109_TSE_2023_3265962 crossref_primary_10_1109_MS_2020_3041241 crossref_primary_10_7717_peerj_cs_372 crossref_primary_10_1109_TSE_2023_3278129 crossref_primary_10_1007_s10664_023_10313_y crossref_primary_10_1016_j_infsof_2024_107450 crossref_primary_10_1007_s10664_023_10418_4 crossref_primary_10_1007_s10664_019_09796_5 crossref_primary_10_3390_app11167250 crossref_primary_10_1016_j_entcom_2024_100832 crossref_primary_10_1109_ACCESS_2019_2933318 crossref_primary_10_1016_j_jss_2018_02_041 crossref_primary_10_1007_s10664_023_10305_y crossref_primary_10_1007_s10664_020_09841_8 crossref_primary_10_1016_j_jss_2025_112438 crossref_primary_10_1109_TSE_2022_3143766 crossref_primary_10_1145_3503509 crossref_primary_10_1007_s10664_021_10068_4 crossref_primary_10_1145_3660823 crossref_primary_10_1016_j_jss_2021_110936 crossref_primary_10_1007_s10664_021_09959_3 crossref_primary_10_1145_3680463 crossref_primary_10_1145_3583565 crossref_primary_10_1145_3241743 crossref_primary_10_1007_s10664_018_9659_9 crossref_primary_10_1007_s10664_022_10171_0 crossref_primary_10_1007_s10664_021_10078_2 crossref_primary_10_1002_smr_2395 crossref_primary_10_1142_S0218194022500723 crossref_primary_10_1007_s10796_020_10100_w crossref_primary_10_1145_3408897 crossref_primary_10_1007_s10515_021_00314_w crossref_primary_10_1007_s10515_024_00469_2 crossref_primary_10_1109_TSE_2020_2986415 crossref_primary_10_1080_07421222_2019_1705514 crossref_primary_10_1109_MS_2020_3036758 crossref_primary_10_1007_s10664_024_10606_w crossref_primary_10_1016_j_jss_2020_110863 crossref_primary_10_1007_s10664_023_10373_0 crossref_primary_10_1109_ACCESS_2021_3116061 crossref_primary_10_1007_s10664_024_10482_4 crossref_primary_10_1007_s40747_022_00848_w crossref_primary_10_1109_TSE_2021_3087419 crossref_primary_10_1109_TASE_2020_3035620 crossref_primary_10_1145_3579639 crossref_primary_10_1007_s10664_023_10328_5 crossref_primary_10_1016_j_infsof_2021_106791 crossref_primary_10_1007_s10664_023_10382_z crossref_primary_10_1007_s11334_021_00422_6 crossref_primary_10_1007_s10515_022_00364_8 crossref_primary_10_1145_3688841 crossref_primary_10_1016_j_infsof_2024_107404 crossref_primary_10_1109_TSE_2020_2978819 crossref_primary_10_1007_s10664_024_10462_8 crossref_primary_10_1109_TSE_2021_3088759 crossref_primary_10_7717_peerj_cs_2429 crossref_primary_10_1016_j_jss_2024_112101 crossref_primary_10_1007_s10664_022_10261_z crossref_primary_10_1007_s10664_024_10523_y crossref_primary_10_1016_j_jss_2022_111458 crossref_primary_10_1007_s13748_018_0147_7 crossref_primary_10_1007_s10664_024_10584_z crossref_primary_10_1109_TSE_2024_3408448 crossref_primary_10_1016_j_jss_2020_110704 crossref_primary_10_1016_j_jss_2018_09_016 crossref_primary_10_1007_s10664_021_09961_9 crossref_primary_10_1007_s10664_020_09926_4 crossref_primary_10_1007_s10664_018_9641_6 crossref_primary_10_1007_s10664_025_10626_0 crossref_primary_10_1145_3433928 crossref_primary_10_1016_j_procs_2019_12_042 crossref_primary_10_1007_s10664_020_09847_2 crossref_primary_10_1109_ACCESS_2023_3314572 crossref_primary_10_1002_smr_2441 crossref_primary_10_1007_s10664_023_10368_x crossref_primary_10_1007_s11219_021_09564_z crossref_primary_10_1016_j_respol_2024_105008 |
| Cites_doi | 10.1109/32.895984 10.1201/9781420006742 10.1145/337180.337209 10.1145/267580.267590 10.1145/2635868.2635922 10.1109/ICSME.2014.45 10.1109/ICSE.2015.245 10.1145/2814189.2815369 10.1109/CSMR.2013.48 10.1145/2597073.2597118 10.1023/A:1010933404324 10.1007/978-3-540-71301-2_29 10.1007/978-3-642-10294-3_1 10.1145/2597073.2597074 10.1109/APNOMS.2014.6996525 10.1147/sj.153.0225 10.1109/ICSME.2014.62 10.1109/ICECCS.2013.42 10.1109/COMPSAC.2013.55 10.1109/ICST.2008.47 10.1007/978-3-319-13734-6_6 10.1145/2025113.2025119 10.1109/ISSRE.2013.6698918 10.1109/METRIC.1997.637174 10.1109/32.935855 10.1109/ISSRE.2005.35 10.1145/1085313.1085331 |
| ContentType | Journal Article |
| Copyright | Springer Science+Business Media New York 2017 Copyright Springer Science & Business Media 2017 |
| Copyright_xml | – notice: Springer Science+Business Media New York 2017 – notice: Copyright Springer Science & Business Media 2017 |
| DBID | AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1007/s10664-017-9512-6 |
| DatabaseName | CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1573-7616 |
| EndPage | 3253 |
| ExternalDocumentID | 10_1007_s10664_017_9512_6 |
| GroupedDBID | -4Z -59 -5G -BR -EM -Y2 -~C .86 .DC .VR 06D 0R~ 0VY 199 1N0 1SB 2.D 203 28- 29G 2J2 2JN 2JY 2KG 2LR 2P1 2VQ 2~H 30V 4.4 406 408 409 40D 40E 5GY 5QI 5VS 67Z 6NX 78A 8FE 8FG 8TC 8UJ 95- 95. 95~ 96X AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AAOBN AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYOK AAYQN AAYTO AAYZH ABAKF ABBBX ABBXA ABDZT ABECU ABFTD ABFTV ABHLI ABHQN ABJCF ABJNI ABJOX ABKCH ABKTR ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABWNU ABXPI ACAOD ACBXY ACDTI ACGFS ACHSB ACHXU ACIWK ACKNC ACMDZ ACMLO ACOKC ACOMO ACPIV ACSNA ACZOJ ADHHG ADHIR ADIMF ADINQ ADKNI ADKPE ADRFC ADTPH ADURQ ADYFF ADZKW AEBTG AEFIE AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFEXP AFGCZ AFKRA AFLOW AFQWF AFWTZ AFZKB AGAYW AGDGC AGGDS AGJBK AGMZJ AGQEE AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO AJZVZ ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMXSW AMYLF AMYQR AOCGG ARAPS ARMRJ ASPBG AVWKF AXYYD AYJHY AZFZN B-. BA0 BBWZM BDATZ BENPR BGLVJ BGNMA BSONS CAG CCPQU COF CS3 CSCUP DDRTE DL5 DNIVK DPUIP DU5 EBLON EBS EIOEI EJD ESBYG FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNWQR GQ6 GQ7 GQ8 GXS H13 HCIFZ HF~ HG5 HG6 HMJXF HQYDN HRMNR HVGLF HZ~ I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JBSCW JCJTX JZLTJ KDC KOV KOW L6V LAK LLZTM M4Y M7S MA- N2Q NB0 NDZJH NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM P19 P62 P9O PF0 PT4 PT5 PTHSS Q2X QOK QOS R4E R89 R9I RHV RNI RNS ROL RPX RSV RZC RZE RZK S0W S16 S1Z S26 S27 S28 S3B SAP SCJ SCLPG SCO SDH SDM SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TSG TSK TSV TUC U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW W23 W48 WK8 YLTOR Z45 Z7R Z7S Z7V Z7X Z7Z Z81 Z83 Z86 Z88 Z8M Z8N Z8P Z8R Z8T Z8U Z8W Z92 ZMTXR ~EX AAPKM AAYXX ABBRH ABDBE ABFSG ABRTQ ACSTC ADHKG AEZWR AFDZB AFHIU AFOHR AGQPQ AHPBZ AHWEU AIXLP ATHPR AYFIA CITATION PHGZM PHGZT PQGLB PUEGO 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c435t-9ee067111ceaa5b8f9d68d10bd67aef624ef014b38f77ee7e9d75aa4fb368c0f3 |
| IEDL.DBID | U2A |
| ISSN | 1382-3256 |
| IngestDate | Fri Jul 25 12:26:23 EDT 2025 Wed Oct 01 04:12:35 EDT 2025 Thu Apr 24 23:10:48 EDT 2025 Fri Feb 21 02:35:47 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 6 |
| Keywords | Data curation Curation tools GitHub Mining software repositories |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c435t-9ee067111ceaa5b8f9d68d10bd67aef624ef014b38f77ee7e9d75aa4fb368c0f3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0003-2071-664X |
| PQID | 2063574530 |
| PQPubID | 326341 |
| PageCount | 35 |
| ParticipantIDs | proquest_journals_2063574530 crossref_citationtrail_10_1007_s10664_017_9512_6 crossref_primary_10_1007_s10664_017_9512_6 springer_journals_10_1007_s10664_017_9512_6 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2017-12-01 |
| PublicationDateYYYYMMDD | 2017-12-01 |
| PublicationDate_xml | – month: 12 year: 2017 text: 2017-12-01 day: 01 |
| PublicationDecade | 2010 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York – name: Dordrecht |
| PublicationSubtitle | An International Journal |
| PublicationTitle | Empirical software engineering : an international journal |
| PublicationTitleAbbrev | Empir Software Eng |
| PublicationYear | 2017 |
| Publisher | Springer US Springer Nature B.V |
| Publisher_xml | – name: Springer US – name: Springer Nature B.V |
| References | Munaiah N, Kroh S, Cabrey C, Nagappan M (2016a) Home of the reporeapers. https://reporeapers.github.io, accessed: 2016-03-11 BreimanLRandom forestsMach Learn200145153210.1023/A:10109334043241007.68152 Dyer R, Nguyen HA, Rajan H, Nguyen TN (2013) Boa: a language and infrastructure for analyzing ultra-large-scale software repositories Proceedings of the 2013 international conference on software engineering, IEEE Press, Piscataway, NJ, USA, ICSE ’13, pp 422–431. http://dl.acm.org/citation.cfm?id=2486788.2486844 Gousios G (2013) The GHTorrent dataset and tool suite Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 233–236. http://dl.acm.org/citation.cfm?id=2487085.2487132 Software Freedom Law Center (2012) Managing copyright information within a free software project - software freedom law center. http://softwarefreedom.org/resources/2012/managingcopyrightinformation.html, accessed: 2015-05-15 GitHub Inc (2016b) Github archive. https://www.githubarchive.org/, accessed: 2016-06-19 GHTorrent (2016b) The relational DB schema. http://ghtorrent.org/relational.html, accessed: 2016-03-11 Carlo Z (2016) Githut - programming languages and GitHub. http://githut.info, accessed: 2016-03-11 GitHub Inc (2016c) No license - choose a license. http://choosealicense.com/no-license/, accessed: 2016-03-11 Tung YH, Chuang CJ, Shan HL (2014) A framework of code reuse in open source software The 16th asia-pacific network operations and management symposium, pp 1–6. doi:10.1109/APNOMS.2014.6996525 JarczykOGruszkaBJaroszewiczSBukowskiLWierzbickiAGithub projects. Quality analysis of open-source software2014ChamSpringer International Publishing8094 Allamanis M, Sutton C (2013) Mining source code repositories at massive scale using language modeling Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 207–216. http://dl.acm.org/citation.cfm?id=2487085.2487127 GitHub Inc (2016a) Github API v3—github developer guide. https://developer.github.com/v3/, accessed: 2016-03-11 Mockus A, Fielding R T, Herbsleb J (2000) A case study of open source software development: the apache server Proceedings of the 2000 international conference on software engineering. ICSE 2000 the new millennium, pp 263–272. doi:10.1145/337180.337209 Codetree Studios (2016) Codetree - GitHub issues, managed. https://codetree.com/, accessed: 2016-03-11 Nagappan N, Williams L, Osborne J, Vouk M, Abrahamsson P (2005) Providing test quality feedback using static source code and automatic test suite metrics 16th IEEE international symposium on software reliability engineering (ISSRE’05), pp 10–94. doi:10.1109/ISSRE.2005.35 Bird C, Nagappan N, Murphy B, Gall H, Devanbu P (2011) Don’t touch my code!: examining the effects of ownership on software quality Proceedings of the 19th ACM SIGSOFT symposium and the 13th european conference on foundations of software engineering, ACM, New York, NY, USA, ESEC/FSE ’11, pp 4–14. doi:10.1145/2025113.2025119 Bissyandé TF, Lo D, Jiang L, Réveillère L, Klein J, Traon YL (2013) Got issues? Who cares about it? A large scale investigation of issue trackers from GitHub 2013 IEEE 24th international symposium on software reliability engineering (ISSRE), pp 188–197. doi:10.1109/ISSRE.2013.6698918 Zaidman A, Rompaey BV, Demeyer S, v Deursen A (2008) Mining software repositories to study co-evolution of production & test code 2008 1st international conference on software testing, verification, and validation, pp 220–229. doi:10.1109/ICST.2008.47 EmamKEBenlarbiSGoelNRaiSNThe confounding effect of class size on the validity of object-oriented metricsIEEE Trans Softw Eng200127763065010.1109/32.935855 de Souza CB, Anquetil N, de Oliveira KM (2005) A study of the documentation essential to software maintenance Proceedings of the 23rd annual international conference on design of communication: documenting & designing for pervasive information, ACM, New York, NY, USA, SIGDOC ’05, pp 68–75. doi:10.1145/1085313.1085331 Iowa State University (2016) Publications related to Boa - Boa - Iowa State University. http://boa.cs.iastate.edu/papers/, accessed: 2016-03-11 Syer MD, Nagappan M, Hassan AE, Adams B (2013) Revisiting prior empirical findings for mobile apps: an empirical case study on the 15 most popular open-source android apps Proceedings of the 2013 conference of the center for advanced studies on collaborative research, IBM corp., riverton, NJ, USA, CASCON ’13, pp 283–297. http://dl.acm.org/citation.cfm?id=2555523.2555553 Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D (2014) The promises and perils of mining GitHub Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 92–101. doi:10.1145/2597073.2597074 WhiteheadJMistríkIGrundyJvan der HoekACollaborative software engineering: concepts and techniques2010BerlinSpringer13010.1007/978-3-642-10294-3_11183.68199 Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013a) Orion: a software project search engine with integrated diverse software artifacts 2013 18th international conference on engineering of complex computer systems, pp 242–245. doi:10.1109/ICECCS.2013.42 Ray B, Posnett D, Filkov V, Devanbu P (2014) A large scale study of programming languages and code quality in github Proceedings of the 22nd ACM SIGSOFT international symposium on foundations of software engineering, ACM, New York, NY, USA, FSE 2014, pp 155–165. doi:10.1145/2635868.2635922 Rosenberg J (1997) Some misconceptions about lines of code Proceedings fourth international software metrics symposium, pp 137–142. doi:10.1109/METRIC.1997.637174 Danial A (2014) CLOC – Count lines of code. http://cloc.sourceforge.net/, accessed: 2016-03-11, version: 1.62 Guzman E, Azócar D, Li Y (2014) Sentiment analysis of commit comments in GitHub: an empirical study Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 352–355. doi:10.1145/2597073.2597118 Kofink A (2015) Contributions of the under-appreciated: gender bias in an open-source ecology Companion proceedings of the 2015 ACM SIGPLAN international conference on systems, programming, languages and applications: Software for humanity, ACM, New York, NY, USA, SPLASH Companion 2015, pp 83–84. doi:10.1145/2814189.2815369 Munaiah N, Kroh S, Cabrey C, Parikh N (2016b) Reaper - reference implementation. https://github.com/reporeapers/reaper, accessed: 2016-03-11 GHTorrent (2016a) Hall of fame. http://ghtorrent.org/halloffame.html, accessed: 2016-03-11 HuBoard Inc (2016) Huboard - github issues made awesome. https://huboard.com/, accessed: 2016-03-11 Laplante P (2007) What every engineer should know about software engineering. What every engineer should know. CRC Press Vendome C (2015) A large scale study of license usage on GitHub 2015 IEEE/ACM 37th IEEE international conference on software engineering, vol 2, pp 772–774. doi:10.1109/ICSE.2015.245 Zenhub (2016) Zenhub - project management for agile teams on GitHub. https://www.zenhub.io/, accessed: 2016-03-11 Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013b) Popularity, interoperability, and impact of programming languages in 100,000 open source projects 2013 IEEE 37th annual computer software and applications conference, pp 303–312. doi:10.1109/COMPSAC.2013.55 CA Technologies (2016) Waffle.io - work better on GitHub issues. https://waffle.io/, accessed: 2016-03-11 Kochhar PS, Bissyandé TF, Lo D, Jiang L (2013) Adoption of software testing in open source projects–a preliminary study on 50,000 projects 2013 17th european conference on software maintenance and reengineering, pp 353–356. doi:10.1109/CSMR.2013.48 NagappanNPotential of open source systems as project repositories for empirical studies working group results2007BerlinSpringer103107 Sajnani H, Saini V, Ossher J, Lopes CV (2014) Is popularity a measure of quality? an analysis of maven components 2014 IEEE international conference on software maintenance and evolution, pp 231–240. doi:10.1109/ICSME.2014.45 Vasilescu B, van Schuylenburg S, Wulms J, Serebrenik A, van den Brand MGJ (2014) Continuous integration in a social-coding world empirical evidence from GitHub 2014 IEEE international conference on software maintenance and evolution, pp 401–405. doi:10.1109/ICSME.2014.62 BeladyLALehmanMMA model of large program developmentIBM Syst J197615322525210.1147/sj.153.02250329.68014 RossSMPeirce’s criterion for the elimination of suspect experimental dataJ Eng Technol20032023841 ZhuHHallPAVMayJHRSoftware unit test coverage and adequacyACM Comput Surv199729436642710.1145/267580.267590 EickSGGravesTLKarrAFMarronJSMockusADoes code decay? Assessing the evidence from change management dataIEEE Trans Softw Eng200127111210.1109/32.895984 9512_CR19 9512_CR18 9512_CR11 9512_CR10 9512_CR32 9512_CR13 9512_CR35 9512_CR12 9512_CR34 9512_CR36 9512_CR17 9512_CR39 9512_CR16 9512_CR38 9512_CR31 9512_CR30 N Nagappan (9512_CR33) 2007 SM Ross (9512_CR37) 2003; 20 KE Emam (9512_CR15) 2001; 27 9512_CR29 LA Belady (9512_CR2) 1976; 15 H Zhu (9512_CR47) 1997; 29 9512_CR22 O Jarczyk (9512_CR25) 2014 9512_CR21 9512_CR43 9512_CR24 9512_CR46 9512_CR23 9512_CR45 9512_CR26 L Breiman (9512_CR7) 2001; 45 9512_CR28 9512_CR27 9512_CR8 9512_CR6 SG Eick (9512_CR14) 2001; 27 J Whitehead (9512_CR44) 2010 9512_CR5 9512_CR4 9512_CR40 9512_CR3 9512_CR20 9512_CR42 9512_CR1 9512_CR41 9512_CR9 |
| References_xml | – reference: GitHub Inc (2016c) No license - choose a license. http://choosealicense.com/no-license/, accessed: 2016-03-11 – reference: Gousios G (2013) The GHTorrent dataset and tool suite Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 233–236. http://dl.acm.org/citation.cfm?id=2487085.2487132 – reference: JarczykOGruszkaBJaroszewiczSBukowskiLWierzbickiAGithub projects. Quality analysis of open-source software2014ChamSpringer International Publishing8094 – reference: Munaiah N, Kroh S, Cabrey C, Parikh N (2016b) Reaper - reference implementation. https://github.com/reporeapers/reaper, accessed: 2016-03-11 – reference: GitHub Inc (2016a) Github API v3—github developer guide. https://developer.github.com/v3/, accessed: 2016-03-11 – reference: WhiteheadJMistríkIGrundyJvan der HoekACollaborative software engineering: concepts and techniques2010BerlinSpringer13010.1007/978-3-642-10294-3_11183.68199 – reference: Iowa State University (2016) Publications related to Boa - Boa - Iowa State University. http://boa.cs.iastate.edu/papers/, accessed: 2016-03-11 – reference: CA Technologies (2016) Waffle.io - work better on GitHub issues. https://waffle.io/, accessed: 2016-03-11 – reference: GHTorrent (2016a) Hall of fame. http://ghtorrent.org/halloffame.html, accessed: 2016-03-11 – reference: Vendome C (2015) A large scale study of license usage on GitHub 2015 IEEE/ACM 37th IEEE international conference on software engineering, vol 2, pp 772–774. doi:10.1109/ICSE.2015.245 – reference: Laplante P (2007) What every engineer should know about software engineering. What every engineer should know. CRC Press – reference: Zenhub (2016) Zenhub - project management for agile teams on GitHub. https://www.zenhub.io/, accessed: 2016-03-11 – reference: Munaiah N, Kroh S, Cabrey C, Nagappan M (2016a) Home of the reporeapers. https://reporeapers.github.io, accessed: 2016-03-11 – reference: Nagappan N, Williams L, Osborne J, Vouk M, Abrahamsson P (2005) Providing test quality feedback using static source code and automatic test suite metrics 16th IEEE international symposium on software reliability engineering (ISSRE’05), pp 10–94. doi:10.1109/ISSRE.2005.35 – reference: Syer MD, Nagappan M, Hassan AE, Adams B (2013) Revisiting prior empirical findings for mobile apps: an empirical case study on the 15 most popular open-source android apps Proceedings of the 2013 conference of the center for advanced studies on collaborative research, IBM corp., riverton, NJ, USA, CASCON ’13, pp 283–297. http://dl.acm.org/citation.cfm?id=2555523.2555553 – reference: de Souza CB, Anquetil N, de Oliveira KM (2005) A study of the documentation essential to software maintenance Proceedings of the 23rd annual international conference on design of communication: documenting & designing for pervasive information, ACM, New York, NY, USA, SIGDOC ’05, pp 68–75. doi:10.1145/1085313.1085331 – reference: GHTorrent (2016b) The relational DB schema. http://ghtorrent.org/relational.html, accessed: 2016-03-11 – reference: RossSMPeirce’s criterion for the elimination of suspect experimental dataJ Eng Technol20032023841 – reference: Tung YH, Chuang CJ, Shan HL (2014) A framework of code reuse in open source software The 16th asia-pacific network operations and management symposium, pp 1–6. doi:10.1109/APNOMS.2014.6996525 – reference: Carlo Z (2016) Githut - programming languages and GitHub. http://githut.info, accessed: 2016-03-11 – reference: Bird C, Nagappan N, Murphy B, Gall H, Devanbu P (2011) Don’t touch my code!: examining the effects of ownership on software quality Proceedings of the 19th ACM SIGSOFT symposium and the 13th european conference on foundations of software engineering, ACM, New York, NY, USA, ESEC/FSE ’11, pp 4–14. doi:10.1145/2025113.2025119 – reference: Danial A (2014) CLOC – Count lines of code. http://cloc.sourceforge.net/, accessed: 2016-03-11, version: 1.62 – reference: Zaidman A, Rompaey BV, Demeyer S, v Deursen A (2008) Mining software repositories to study co-evolution of production & test code 2008 1st international conference on software testing, verification, and validation, pp 220–229. doi:10.1109/ICST.2008.47 – reference: Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013b) Popularity, interoperability, and impact of programming languages in 100,000 open source projects 2013 IEEE 37th annual computer software and applications conference, pp 303–312. doi:10.1109/COMPSAC.2013.55 – reference: Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D (2014) The promises and perils of mining GitHub Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 92–101. doi:10.1145/2597073.2597074 – reference: Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013a) Orion: a software project search engine with integrated diverse software artifacts 2013 18th international conference on engineering of complex computer systems, pp 242–245. doi:10.1109/ICECCS.2013.42 – reference: EmamKEBenlarbiSGoelNRaiSNThe confounding effect of class size on the validity of object-oriented metricsIEEE Trans Softw Eng200127763065010.1109/32.935855 – reference: Kochhar PS, Bissyandé TF, Lo D, Jiang L (2013) Adoption of software testing in open source projects–a preliminary study on 50,000 projects 2013 17th european conference on software maintenance and reengineering, pp 353–356. doi:10.1109/CSMR.2013.48 – reference: BeladyLALehmanMMA model of large program developmentIBM Syst J197615322525210.1147/sj.153.02250329.68014 – reference: Allamanis M, Sutton C (2013) Mining source code repositories at massive scale using language modeling Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 207–216. http://dl.acm.org/citation.cfm?id=2487085.2487127 – reference: Guzman E, Azócar D, Li Y (2014) Sentiment analysis of commit comments in GitHub: an empirical study Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 352–355. doi:10.1145/2597073.2597118 – reference: NagappanNPotential of open source systems as project repositories for empirical studies working group results2007BerlinSpringer103107 – reference: GitHub Inc (2016b) Github archive. https://www.githubarchive.org/, accessed: 2016-06-19 – reference: Mockus A, Fielding R T, Herbsleb J (2000) A case study of open source software development: the apache server Proceedings of the 2000 international conference on software engineering. ICSE 2000 the new millennium, pp 263–272. doi:10.1145/337180.337209 – reference: Sajnani H, Saini V, Ossher J, Lopes CV (2014) Is popularity a measure of quality? an analysis of maven components 2014 IEEE international conference on software maintenance and evolution, pp 231–240. doi:10.1109/ICSME.2014.45 – reference: Software Freedom Law Center (2012) Managing copyright information within a free software project - software freedom law center. http://softwarefreedom.org/resources/2012/managingcopyrightinformation.html, accessed: 2015-05-15 – reference: BreimanLRandom forestsMach Learn200145153210.1023/A:10109334043241007.68152 – reference: EickSGGravesTLKarrAFMarronJSMockusADoes code decay? Assessing the evidence from change management dataIEEE Trans Softw Eng200127111210.1109/32.895984 – reference: HuBoard Inc (2016) Huboard - github issues made awesome. https://huboard.com/, accessed: 2016-03-11 – reference: Ray B, Posnett D, Filkov V, Devanbu P (2014) A large scale study of programming languages and code quality in github Proceedings of the 22nd ACM SIGSOFT international symposium on foundations of software engineering, ACM, New York, NY, USA, FSE 2014, pp 155–165. doi:10.1145/2635868.2635922 – reference: Vasilescu B, van Schuylenburg S, Wulms J, Serebrenik A, van den Brand MGJ (2014) Continuous integration in a social-coding world empirical evidence from GitHub 2014 IEEE international conference on software maintenance and evolution, pp 401–405. doi:10.1109/ICSME.2014.62 – reference: Codetree Studios (2016) Codetree - GitHub issues, managed. https://codetree.com/, accessed: 2016-03-11 – reference: Kofink A (2015) Contributions of the under-appreciated: gender bias in an open-source ecology Companion proceedings of the 2015 ACM SIGPLAN international conference on systems, programming, languages and applications: Software for humanity, ACM, New York, NY, USA, SPLASH Companion 2015, pp 83–84. doi:10.1145/2814189.2815369 – reference: Rosenberg J (1997) Some misconceptions about lines of code Proceedings fourth international software metrics symposium, pp 137–142. doi:10.1109/METRIC.1997.637174 – reference: Dyer R, Nguyen HA, Rajan H, Nguyen TN (2013) Boa: a language and infrastructure for analyzing ultra-large-scale software repositories Proceedings of the 2013 international conference on software engineering, IEEE Press, Piscataway, NJ, USA, ICSE ’13, pp 422–431. http://dl.acm.org/citation.cfm?id=2486788.2486844 – reference: ZhuHHallPAVMayJHRSoftware unit test coverage and adequacyACM Comput Surv199729436642710.1145/267580.267590 – reference: Bissyandé TF, Lo D, Jiang L, Réveillère L, Klein J, Traon YL (2013) Got issues? Who cares about it? A large scale investigation of issue trackers from GitHub 2013 IEEE 24th international symposium on software reliability engineering (ISSRE), pp 188–197. doi:10.1109/ISSRE.2013.6698918 – volume: 27 start-page: 1 issue: 1 year: 2001 ident: 9512_CR14 publication-title: IEEE Trans Softw Eng doi: 10.1109/32.895984 – ident: 9512_CR29 doi: 10.1201/9781420006742 – ident: 9512_CR39 – ident: 9512_CR30 doi: 10.1145/337180.337209 – volume: 29 start-page: 366 issue: 4 year: 1997 ident: 9512_CR47 publication-title: ACM Comput Surv doi: 10.1145/267580.267590 – ident: 9512_CR35 doi: 10.1145/2635868.2635922 – ident: 9512_CR8 – ident: 9512_CR38 doi: 10.1109/ICSME.2014.45 – ident: 9512_CR43 doi: 10.1109/ICSE.2015.245 – ident: 9512_CR28 doi: 10.1145/2814189.2815369 – ident: 9512_CR20 – ident: 9512_CR24 – ident: 9512_CR1 – ident: 9512_CR27 doi: 10.1109/CSMR.2013.48 – ident: 9512_CR22 doi: 10.1145/2597073.2597118 – volume: 45 start-page: 5 issue: 1 year: 2001 ident: 9512_CR7 publication-title: Mach Learn doi: 10.1023/A:1010933404324 – ident: 9512_CR32 – start-page: 103 volume-title: Potential of open source systems as project repositories for empirical studies working group results year: 2007 ident: 9512_CR33 doi: 10.1007/978-3-540-71301-2_29 – start-page: 1 volume-title: Collaborative software engineering: concepts and techniques year: 2010 ident: 9512_CR44 doi: 10.1007/978-3-642-10294-3_1 – ident: 9512_CR26 doi: 10.1145/2597073.2597074 – ident: 9512_CR17 – ident: 9512_CR19 – ident: 9512_CR40 – ident: 9512_CR13 – ident: 9512_CR41 doi: 10.1109/APNOMS.2014.6996525 – ident: 9512_CR9 – ident: 9512_CR11 – volume: 15 start-page: 225 issue: 3 year: 1976 ident: 9512_CR2 publication-title: IBM Syst J doi: 10.1147/sj.153.0225 – volume: 20 start-page: 38 issue: 2 year: 2003 ident: 9512_CR37 publication-title: J Eng Technol – ident: 9512_CR42 doi: 10.1109/ICSME.2014.62 – ident: 9512_CR5 doi: 10.1109/ICECCS.2013.42 – ident: 9512_CR21 – ident: 9512_CR6 doi: 10.1109/COMPSAC.2013.55 – ident: 9512_CR23 – ident: 9512_CR45 doi: 10.1109/ICST.2008.47 – start-page: 80 volume-title: Github projects. Quality analysis of open-source software year: 2014 ident: 9512_CR25 doi: 10.1007/978-3-319-13734-6_6 – ident: 9512_CR3 doi: 10.1145/2025113.2025119 – ident: 9512_CR46 – ident: 9512_CR4 doi: 10.1109/ISSRE.2013.6698918 – ident: 9512_CR36 doi: 10.1109/METRIC.1997.637174 – volume: 27 start-page: 630 issue: 7 year: 2001 ident: 9512_CR15 publication-title: IEEE Trans Softw Eng doi: 10.1109/32.935855 – ident: 9512_CR10 – ident: 9512_CR31 – ident: 9512_CR34 doi: 10.1109/ISSRE.2005.35 – ident: 9512_CR12 doi: 10.1145/1085313.1085331 – ident: 9512_CR16 – ident: 9512_CR18 |
| SSID | ssj0009745 |
| Score | 2.6014695 |
| Snippet | Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of... |
| SourceID | proquest crossref springer |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 3219 |
| SubjectTerms | Classification Classifiers Compilers Computer Science Ground truth Interpreters Noise Programming Languages Recall Repositories Researchers Software engineering Software Engineering/Programming and Operating Systems |
| Title | Curating GitHub for engineered software projects |
| URI | https://link.springer.com/article/10.1007/s10664-017-9512-6 https://www.proquest.com/docview/2063574530 |
| Volume | 22 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVLSH databaseName: SpringerLink Journals customDbUrl: mediaType: online eissn: 1573-7616 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0009745 issn: 1382-3256 databaseCode: AFBBN dateStart: 19970301 isFulltext: true providerName: Library Specific Holdings – providerCode: PRVAVX databaseName: SpringerLINK - Czech Republic Consortium customDbUrl: eissn: 1573-7616 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0009745 issn: 1382-3256 databaseCode: AGYKE dateStart: 19970101 isFulltext: true titleUrlDefault: http://link.springer.com providerName: Springer Nature – providerCode: PRVAVX databaseName: SpringerLink Journals (ICM) customDbUrl: eissn: 1573-7616 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009745 issn: 1382-3256 databaseCode: U2A dateStart: 19970101 isFulltext: true titleUrlDefault: http://www.springerlink.com/journals/ providerName: Springer Nature |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8MwDLZgu3DhjRiMKQdOoEhZ82qP07QxgeDEpHGqkjZBSKigtRN_n6RLKSBA4tzUh8-J7cT2Z4BzajLJeCQxUUpjphOJlfUNzIoormxsdeS7kW_vxGzOrhd8Efq4y6bavUlJ1pb6U7ObEL5iwqcchxEWm9Dlns3LbeJ5NGqZdmU9mdhz62HqHHqTyvxJxFdn1EaY35Kita-Z7sJ2CBLRaK3VPdgwxT7sNAMYUDiPB0DGK6_A4hFdPVWzlUYuAkUmMAyaHJXOxr6ppUHhvaU8hPl0cj-e4TACAWcujqlwYoxzJ84eZUYprmOb5CLOh0TnQipjRcSMdZccTWMrpTHSJLnkSjGrqYgzYukRdIqXwhwDSjjPEsucBE_y51nmqdAx1Yy7S0LO8h6QBos0C_zgfkzFc9oyG3v4Ugdf6uFLRQ8uPn55XZNj_LW43wCchnNSphHxfHiMU9KDywb09vOvwk7-tfoUtiKv9LoKpQ-darkyZy6WqPQAuqOrh5vJoN5D7_MfwHM |
| linkProvider | Springer Nature |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwMhECZaD3rxbaxW5eBJQ0KX1-6xaayrtj21SW8EdsGYmNV0t_HvC1vWqlETz8AcPmBmYGa-AeCSmExQFgmEldKI6kQgZX0Bs8KKKRtbHflq5NGYp1N6P2OzUMddNtnuTUiy1tSfit049xkTPuTYjRBfBxuev8oT5k-j3oppV9SdiT23HiLOoDehzJ9EfDVGKw_zW1C0tjWDXbAdnETYW-7qHlgzxT7YaRowwHAfDwDuL_wGFo_w9qlKFxo6DxSawDBoclg6Hfum5gaG_5byEEwHN5N-ikILBJQ5P6ZCiTHOnDh9lBmlmI5tkvM472Kdc6GM5RE11j1yNImtEMYIk-SCKUWtJjzOsCVHoFW8FOYYwISxLLHUSfAkf55lnnAdE02ZeyTkNG8D3GAhs8AP7ttUPMsVs7GHTzr4pIdP8ja4-ljyuiTH-GtypwFYhntSygh7PjzKCG6D6wb01fCvwk7-NfsCbKaT0VAO78YPp2Ar8gegzkjpgFY1X5gz51dU-rw-R-8NwcHL |
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA5aQbz4FqtVc_CkhKab1-6xVGt9FQ8WegvJJhFB1tLd4t832e5aFRU8J5nDN0lmkpn5BoBTYlNBWSQQVkojqhOBlAsFzAorplzsdBSqke-HfDCiN2M2rvqc5nW2ex2SnNc0BJamrGhPjGt_KnzjPGRPhPBjJ0J8GazQwJPgN_Qo6i5Yd0XZpTjw7CHijXsd1vxJxFfDtPA2vwVIS7vT3wTrlcMIu3MNb4Elm22DjboZA6zO5g7AvVlQZvYEr56LwUxD741CW7ENWgNzf9--qamF1d9LvgtG_cvH3gBV7RBQ6n2aAiXWetPi76bUKsV07BLDY9PB2nChrOMRtc4_eDSJnRDWCpsYwZSiThMep9iRPdDIXjO7D2DCWJo46iUEwr_AOE-4jommzD8YDDVNgGssZFpxhYeWFS9ywXIc4JMePhngk7wJzj6WTOZEGX9NbtUAy-rM5DLCgRuPMoKb4LwGfTH8q7CDf80-AasPF315dz28PQRrUdB_mZzSAo1iOrNH3sUo9HG5jd4BpCnGBw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Curating+GitHub+for+engineered+software+projects&rft.jtitle=Empirical+software+engineering+%3A+an+international+journal&rft.au=Munaiah%2C+Nuthan&rft.au=Kroh%2C+Steven&rft.au=Cabrey%2C+Craig&rft.au=Nagappan%2C+Meiyappan&rft.date=2017-12-01&rft.issn=1382-3256&rft.eissn=1573-7616&rft.volume=22&rft.issue=6&rft.spage=3219&rft.epage=3253&rft_id=info:doi/10.1007%2Fs10664-017-9512-6&rft.externalDBID=n%2Fa&rft.externalDocID=10_1007_s10664_017_9512_6 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1382-3256&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1382-3256&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1382-3256&client=summon |