Curating GitHub for engineered software projects

Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited m...

Full description

Saved in:
Bibliographic Details
Published inEmpirical software engineering : an international journal Vol. 22; no. 6; pp. 3219 - 3253
Main Authors Munaiah, Nuthan, Kroh, Steven, Cabrey, Craig, Nagappan, Meiyappan
Format Journal Article
LanguageEnglish
Published New York Springer US 01.12.2017
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1382-3256
1573-7616
DOI10.1007/s10664-017-9512-6

Cover

Abstract Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called reaper , to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used reaper to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population.
AbstractList Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called reaper , to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used reaper to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population.
Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of potential study subjects with the help of tools like GHTorrent and Boa. However, the simplicity in querying comes with a caveat: there are limited means of separating the signal (e.g. repositories containing engineered software projects) from the noise (e.g. repositories containing home work assignments). The proportion of noise in a random sample of repositories could skew the study and may lead to researchers reaching unrealistic, potentially inaccurate, conclusions. We argue that it is imperative to have the ability to sieve out the noise in such large repository forges. We propose a framework, and present a reference implementation of the framework as a tool called reaper, to enable researchers to select GitHub repositories that contain evidence of an engineered software project. We identify software engineering practices (called dimensions) and propose means for validating their existence in a GitHub repository. We used reaper to measure the dimensions of 1,857,423 GitHub repositories. We then used manually classified data sets of repositories to train classifiers capable of predicting if a given GitHub repository contains an engineered software project. The performance of the classifiers was evaluated using a set of 200 repositories with known ground truth classification. We also compared the performance of the classifiers to other approaches to classification (e.g. number of GitHub Stargazers) and found our classifiers to outperform existing approaches. We found stargazers-based classifier (with 10 as the threshold for number of stargazers) to exhibit high precision (97%) but an inversely proportional recall (32%). On the other hand, our best classifier exhibited a high precision (82%) and a high recall (86%). The stargazer-based criteria offers precision but fails to recall a significant portion of the population.
Author Kroh, Steven
Cabrey, Craig
Munaiah, Nuthan
Nagappan, Meiyappan
Author_xml – sequence: 1
  givenname: Nuthan
  orcidid: 0000-0003-2071-664X
  surname: Munaiah
  fullname: Munaiah, Nuthan
  email: nm6061@rit.edu
  organization: Department of Software Engineering, Rochester Institute of Technology
– sequence: 2
  givenname: Steven
  surname: Kroh
  fullname: Kroh, Steven
  organization: Department of Software Engineering, Rochester Institute of Technology
– sequence: 3
  givenname: Craig
  surname: Cabrey
  fullname: Cabrey, Craig
  organization: Department of Software Engineering, Rochester Institute of Technology
– sequence: 4
  givenname: Meiyappan
  surname: Nagappan
  fullname: Nagappan, Meiyappan
  organization: David R. Cheriton School of Computer Science, University of Waterloo
BookMark eNp9kEFLAzEQhYNUsK3-AG8LnqOTzSbZPUrRVih40XPI7k7KlprUJIv4701ZQRD0NHN437w3b0Fmzjsk5JrBLQNQd5GBlBUFpmgjWEnlGZkzoThVkslZ3nldUl4KeUEWMe4BoFGVmBNYjcGkwe2K9ZA2Y1tYHwp0u8EhBuyL6G36MAGLY_B77FK8JOfWHCJefc8leX18eFlt6PZ5_bS639Ku4iLRBhGkYox1aIxoa9v0su4ZtL1UBq0sK7TAqpbXVilEhU2vhDGVbbmsO7B8SW6mu9n4fcSY9N6PwWVLXYLkIqfnkFVqUnXBxxjQ6m5I-R_vUjDDQTPQp3r0VI_O9ehTPVpmkv0ij2F4M-HzX6acmJi1bofhJ9Pf0BcCq3kV
CitedBy_id crossref_primary_10_1016_j_infsof_2024_107425
crossref_primary_10_1016_j_infsof_2021_106699
crossref_primary_10_7232_JKIIE_2024_50_2_109
crossref_primary_10_1016_j_eswa_2020_114176
crossref_primary_10_1007_s10664_020_09905_9
crossref_primary_10_1007_s10664_024_10568_z
crossref_primary_10_1109_TSE_2023_3244123
crossref_primary_10_1109_ACCESS_2023_3244495
crossref_primary_10_1109_TSE_2022_3224053
crossref_primary_10_1016_j_infsof_2023_107317
crossref_primary_10_1109_TSE_2019_2960357
crossref_primary_10_1016_j_jss_2020_110821
crossref_primary_10_1145_3428236
crossref_primary_10_1007_s10664_021_10058_6
crossref_primary_10_1109_TSE_2021_3063220
crossref_primary_10_3390_fi10030029
crossref_primary_10_1089_hs_2020_0194
crossref_primary_10_1007_s10664_020_09825_8
crossref_primary_10_1109_TSE_2019_2918536
crossref_primary_10_1109_TSE_2021_3051492
crossref_primary_10_1145_3555208
crossref_primary_10_1016_j_jss_2023_111806
crossref_primary_10_1007_s10664_018_9650_5
crossref_primary_10_1142_S0218194023500110
crossref_primary_10_1109_TSE_2021_3073773
crossref_primary_10_1007_s10515_019_00254_6
crossref_primary_10_3390_a14100289
crossref_primary_10_1007_s10664_023_10421_9
crossref_primary_10_1145_3467895
crossref_primary_10_1007_s10664_021_10033_1
crossref_primary_10_1007_s11432_022_3953_4
crossref_primary_10_1109_TSE_2023_3265962
crossref_primary_10_1109_MS_2020_3041241
crossref_primary_10_7717_peerj_cs_372
crossref_primary_10_1109_TSE_2023_3278129
crossref_primary_10_1007_s10664_023_10313_y
crossref_primary_10_1016_j_infsof_2024_107450
crossref_primary_10_1007_s10664_023_10418_4
crossref_primary_10_1007_s10664_019_09796_5
crossref_primary_10_3390_app11167250
crossref_primary_10_1016_j_entcom_2024_100832
crossref_primary_10_1109_ACCESS_2019_2933318
crossref_primary_10_1016_j_jss_2018_02_041
crossref_primary_10_1007_s10664_023_10305_y
crossref_primary_10_1007_s10664_020_09841_8
crossref_primary_10_1016_j_jss_2025_112438
crossref_primary_10_1109_TSE_2022_3143766
crossref_primary_10_1145_3503509
crossref_primary_10_1007_s10664_021_10068_4
crossref_primary_10_1145_3660823
crossref_primary_10_1016_j_jss_2021_110936
crossref_primary_10_1007_s10664_021_09959_3
crossref_primary_10_1145_3680463
crossref_primary_10_1145_3583565
crossref_primary_10_1145_3241743
crossref_primary_10_1007_s10664_018_9659_9
crossref_primary_10_1007_s10664_022_10171_0
crossref_primary_10_1007_s10664_021_10078_2
crossref_primary_10_1002_smr_2395
crossref_primary_10_1142_S0218194022500723
crossref_primary_10_1007_s10796_020_10100_w
crossref_primary_10_1145_3408897
crossref_primary_10_1007_s10515_021_00314_w
crossref_primary_10_1007_s10515_024_00469_2
crossref_primary_10_1109_TSE_2020_2986415
crossref_primary_10_1080_07421222_2019_1705514
crossref_primary_10_1109_MS_2020_3036758
crossref_primary_10_1007_s10664_024_10606_w
crossref_primary_10_1016_j_jss_2020_110863
crossref_primary_10_1007_s10664_023_10373_0
crossref_primary_10_1109_ACCESS_2021_3116061
crossref_primary_10_1007_s10664_024_10482_4
crossref_primary_10_1007_s40747_022_00848_w
crossref_primary_10_1109_TSE_2021_3087419
crossref_primary_10_1109_TASE_2020_3035620
crossref_primary_10_1145_3579639
crossref_primary_10_1007_s10664_023_10328_5
crossref_primary_10_1016_j_infsof_2021_106791
crossref_primary_10_1007_s10664_023_10382_z
crossref_primary_10_1007_s11334_021_00422_6
crossref_primary_10_1007_s10515_022_00364_8
crossref_primary_10_1145_3688841
crossref_primary_10_1016_j_infsof_2024_107404
crossref_primary_10_1109_TSE_2020_2978819
crossref_primary_10_1007_s10664_024_10462_8
crossref_primary_10_1109_TSE_2021_3088759
crossref_primary_10_7717_peerj_cs_2429
crossref_primary_10_1016_j_jss_2024_112101
crossref_primary_10_1007_s10664_022_10261_z
crossref_primary_10_1007_s10664_024_10523_y
crossref_primary_10_1016_j_jss_2022_111458
crossref_primary_10_1007_s13748_018_0147_7
crossref_primary_10_1007_s10664_024_10584_z
crossref_primary_10_1109_TSE_2024_3408448
crossref_primary_10_1016_j_jss_2020_110704
crossref_primary_10_1016_j_jss_2018_09_016
crossref_primary_10_1007_s10664_021_09961_9
crossref_primary_10_1007_s10664_020_09926_4
crossref_primary_10_1007_s10664_018_9641_6
crossref_primary_10_1007_s10664_025_10626_0
crossref_primary_10_1145_3433928
crossref_primary_10_1016_j_procs_2019_12_042
crossref_primary_10_1007_s10664_020_09847_2
crossref_primary_10_1109_ACCESS_2023_3314572
crossref_primary_10_1002_smr_2441
crossref_primary_10_1007_s10664_023_10368_x
crossref_primary_10_1007_s11219_021_09564_z
crossref_primary_10_1016_j_respol_2024_105008
Cites_doi 10.1109/32.895984
10.1201/9781420006742
10.1145/337180.337209
10.1145/267580.267590
10.1145/2635868.2635922
10.1109/ICSME.2014.45
10.1109/ICSE.2015.245
10.1145/2814189.2815369
10.1109/CSMR.2013.48
10.1145/2597073.2597118
10.1023/A:1010933404324
10.1007/978-3-540-71301-2_29
10.1007/978-3-642-10294-3_1
10.1145/2597073.2597074
10.1109/APNOMS.2014.6996525
10.1147/sj.153.0225
10.1109/ICSME.2014.62
10.1109/ICECCS.2013.42
10.1109/COMPSAC.2013.55
10.1109/ICST.2008.47
10.1007/978-3-319-13734-6_6
10.1145/2025113.2025119
10.1109/ISSRE.2013.6698918
10.1109/METRIC.1997.637174
10.1109/32.935855
10.1109/ISSRE.2005.35
10.1145/1085313.1085331
ContentType Journal Article
Copyright Springer Science+Business Media New York 2017
Copyright Springer Science & Business Media 2017
Copyright_xml – notice: Springer Science+Business Media New York 2017
– notice: Copyright Springer Science & Business Media 2017
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1007/s10664-017-9512-6
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1573-7616
EndPage 3253
ExternalDocumentID 10_1007_s10664_017_9512_6
GroupedDBID -4Z
-59
-5G
-BR
-EM
-Y2
-~C
.86
.DC
.VR
06D
0R~
0VY
199
1N0
1SB
2.D
203
28-
29G
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
4.4
406
408
409
40D
40E
5GY
5QI
5VS
67Z
6NX
78A
8FE
8FG
8TC
8UJ
95-
95.
95~
96X
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYOK
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJCF
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACHSB
ACHXU
ACIWK
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACSNA
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFGCZ
AFKRA
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARAPS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
B-.
BA0
BBWZM
BDATZ
BENPR
BGLVJ
BGNMA
BSONS
CAG
CCPQU
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
EBLON
EBS
EIOEI
EJD
ESBYG
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
GQ8
GXS
H13
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
KDC
KOV
KOW
L6V
LAK
LLZTM
M4Y
M7S
MA-
N2Q
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
P19
P62
P9O
PF0
PT4
PT5
PTHSS
Q2X
QOK
QOS
R4E
R89
R9I
RHV
RNI
RNS
ROL
RPX
RSV
RZC
RZE
RZK
S0W
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TSG
TSK
TSV
TUC
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YLTOR
Z45
Z7R
Z7S
Z7V
Z7X
Z7Z
Z81
Z83
Z86
Z88
Z8M
Z8N
Z8P
Z8R
Z8T
Z8U
Z8W
Z92
ZMTXR
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABRTQ
ACSTC
ADHKG
AEZWR
AFDZB
AFHIU
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ATHPR
AYFIA
CITATION
PHGZM
PHGZT
PQGLB
PUEGO
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c435t-9ee067111ceaa5b8f9d68d10bd67aef624ef014b38f77ee7e9d75aa4fb368c0f3
IEDL.DBID U2A
ISSN 1382-3256
IngestDate Fri Jul 25 12:26:23 EDT 2025
Wed Oct 01 04:12:35 EDT 2025
Thu Apr 24 23:10:48 EDT 2025
Fri Feb 21 02:35:47 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 6
Keywords Data curation
Curation tools
GitHub
Mining software repositories
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c435t-9ee067111ceaa5b8f9d68d10bd67aef624ef014b38f77ee7e9d75aa4fb368c0f3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0003-2071-664X
PQID 2063574530
PQPubID 326341
PageCount 35
ParticipantIDs proquest_journals_2063574530
crossref_citationtrail_10_1007_s10664_017_9512_6
crossref_primary_10_1007_s10664_017_9512_6
springer_journals_10_1007_s10664_017_9512_6
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2017-12-01
PublicationDateYYYYMMDD 2017-12-01
PublicationDate_xml – month: 12
  year: 2017
  text: 2017-12-01
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
– name: Dordrecht
PublicationSubtitle An International Journal
PublicationTitle Empirical software engineering : an international journal
PublicationTitleAbbrev Empir Software Eng
PublicationYear 2017
Publisher Springer US
Springer Nature B.V
Publisher_xml – name: Springer US
– name: Springer Nature B.V
References Munaiah N, Kroh S, Cabrey C, Nagappan M (2016a) Home of the reporeapers. https://reporeapers.github.io, accessed: 2016-03-11
BreimanLRandom forestsMach Learn200145153210.1023/A:10109334043241007.68152
Dyer R, Nguyen HA, Rajan H, Nguyen TN (2013) Boa: a language and infrastructure for analyzing ultra-large-scale software repositories Proceedings of the 2013 international conference on software engineering, IEEE Press, Piscataway, NJ, USA, ICSE ’13, pp 422–431. http://dl.acm.org/citation.cfm?id=2486788.2486844
Gousios G (2013) The GHTorrent dataset and tool suite Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 233–236. http://dl.acm.org/citation.cfm?id=2487085.2487132
Software Freedom Law Center (2012) Managing copyright information within a free software project - software freedom law center. http://softwarefreedom.org/resources/2012/managingcopyrightinformation.html, accessed: 2015-05-15
GitHub Inc (2016b) Github archive. https://www.githubarchive.org/, accessed: 2016-06-19
GHTorrent (2016b) The relational DB schema. http://ghtorrent.org/relational.html, accessed: 2016-03-11
Carlo Z (2016) Githut - programming languages and GitHub. http://githut.info, accessed: 2016-03-11
GitHub Inc (2016c) No license - choose a license. http://choosealicense.com/no-license/, accessed: 2016-03-11
Tung YH, Chuang CJ, Shan HL (2014) A framework of code reuse in open source software The 16th asia-pacific network operations and management symposium, pp 1–6. doi:10.1109/APNOMS.2014.6996525
JarczykOGruszkaBJaroszewiczSBukowskiLWierzbickiAGithub projects. Quality analysis of open-source software2014ChamSpringer International Publishing8094
Allamanis M, Sutton C (2013) Mining source code repositories at massive scale using language modeling Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 207–216. http://dl.acm.org/citation.cfm?id=2487085.2487127
GitHub Inc (2016a) Github API v3—github developer guide. https://developer.github.com/v3/, accessed: 2016-03-11
Mockus A, Fielding R T, Herbsleb J (2000) A case study of open source software development: the apache server Proceedings of the 2000 international conference on software engineering. ICSE 2000 the new millennium, pp 263–272. doi:10.1145/337180.337209
Codetree Studios (2016) Codetree - GitHub issues, managed. https://codetree.com/, accessed: 2016-03-11
Nagappan N, Williams L, Osborne J, Vouk M, Abrahamsson P (2005) Providing test quality feedback using static source code and automatic test suite metrics 16th IEEE international symposium on software reliability engineering (ISSRE’05), pp 10–94. doi:10.1109/ISSRE.2005.35
Bird C, Nagappan N, Murphy B, Gall H, Devanbu P (2011) Don’t touch my code!: examining the effects of ownership on software quality Proceedings of the 19th ACM SIGSOFT symposium and the 13th european conference on foundations of software engineering, ACM, New York, NY, USA, ESEC/FSE ’11, pp 4–14. doi:10.1145/2025113.2025119
Bissyandé TF, Lo D, Jiang L, Réveillère L, Klein J, Traon YL (2013) Got issues? Who cares about it? A large scale investigation of issue trackers from GitHub 2013 IEEE 24th international symposium on software reliability engineering (ISSRE), pp 188–197. doi:10.1109/ISSRE.2013.6698918
Zaidman A, Rompaey BV, Demeyer S, v Deursen A (2008) Mining software repositories to study co-evolution of production & test code 2008 1st international conference on software testing, verification, and validation, pp 220–229. doi:10.1109/ICST.2008.47
EmamKEBenlarbiSGoelNRaiSNThe confounding effect of class size on the validity of object-oriented metricsIEEE Trans Softw Eng200127763065010.1109/32.935855
de Souza CB, Anquetil N, de Oliveira KM (2005) A study of the documentation essential to software maintenance Proceedings of the 23rd annual international conference on design of communication: documenting & designing for pervasive information, ACM, New York, NY, USA, SIGDOC ’05, pp 68–75. doi:10.1145/1085313.1085331
Iowa State University (2016) Publications related to Boa - Boa - Iowa State University. http://boa.cs.iastate.edu/papers/, accessed: 2016-03-11
Syer MD, Nagappan M, Hassan AE, Adams B (2013) Revisiting prior empirical findings for mobile apps: an empirical case study on the 15 most popular open-source android apps Proceedings of the 2013 conference of the center for advanced studies on collaborative research, IBM corp., riverton, NJ, USA, CASCON ’13, pp 283–297. http://dl.acm.org/citation.cfm?id=2555523.2555553
Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D (2014) The promises and perils of mining GitHub Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 92–101. doi:10.1145/2597073.2597074
WhiteheadJMistríkIGrundyJvan der HoekACollaborative software engineering: concepts and techniques2010BerlinSpringer13010.1007/978-3-642-10294-3_11183.68199
Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013a) Orion: a software project search engine with integrated diverse software artifacts 2013 18th international conference on engineering of complex computer systems, pp 242–245. doi:10.1109/ICECCS.2013.42
Ray B, Posnett D, Filkov V, Devanbu P (2014) A large scale study of programming languages and code quality in github Proceedings of the 22nd ACM SIGSOFT international symposium on foundations of software engineering, ACM, New York, NY, USA, FSE 2014, pp 155–165. doi:10.1145/2635868.2635922
Rosenberg J (1997) Some misconceptions about lines of code Proceedings fourth international software metrics symposium, pp 137–142. doi:10.1109/METRIC.1997.637174
Danial A (2014) CLOC – Count lines of code. http://cloc.sourceforge.net/, accessed: 2016-03-11, version: 1.62
Guzman E, Azócar D, Li Y (2014) Sentiment analysis of commit comments in GitHub: an empirical study Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 352–355. doi:10.1145/2597073.2597118
Kofink A (2015) Contributions of the under-appreciated: gender bias in an open-source ecology Companion proceedings of the 2015 ACM SIGPLAN international conference on systems, programming, languages and applications: Software for humanity, ACM, New York, NY, USA, SPLASH Companion 2015, pp 83–84. doi:10.1145/2814189.2815369
Munaiah N, Kroh S, Cabrey C, Parikh N (2016b) Reaper - reference implementation. https://github.com/reporeapers/reaper, accessed: 2016-03-11
GHTorrent (2016a) Hall of fame. http://ghtorrent.org/halloffame.html, accessed: 2016-03-11
HuBoard Inc (2016) Huboard - github issues made awesome. https://huboard.com/, accessed: 2016-03-11
Laplante P (2007) What every engineer should know about software engineering. What every engineer should know. CRC Press
Vendome C (2015) A large scale study of license usage on GitHub 2015 IEEE/ACM 37th IEEE international conference on software engineering, vol 2, pp 772–774. doi:10.1109/ICSE.2015.245
Zenhub (2016) Zenhub - project management for agile teams on GitHub. https://www.zenhub.io/, accessed: 2016-03-11
Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013b) Popularity, interoperability, and impact of programming languages in 100,000 open source projects 2013 IEEE 37th annual computer software and applications conference, pp 303–312. doi:10.1109/COMPSAC.2013.55
CA Technologies (2016) Waffle.io - work better on GitHub issues. https://waffle.io/, accessed: 2016-03-11
Kochhar PS, Bissyandé TF, Lo D, Jiang L (2013) Adoption of software testing in open source projects–a preliminary study on 50,000 projects 2013 17th european conference on software maintenance and reengineering, pp 353–356. doi:10.1109/CSMR.2013.48
NagappanNPotential of open source systems as project repositories for empirical studies working group results2007BerlinSpringer103107
Sajnani H, Saini V, Ossher J, Lopes CV (2014) Is popularity a measure of quality? an analysis of maven components 2014 IEEE international conference on software maintenance and evolution, pp 231–240. doi:10.1109/ICSME.2014.45
Vasilescu B, van Schuylenburg S, Wulms J, Serebrenik A, van den Brand MGJ (2014) Continuous integration in a social-coding world empirical evidence from GitHub 2014 IEEE international conference on software maintenance and evolution, pp 401–405. doi:10.1109/ICSME.2014.62
BeladyLALehmanMMA model of large program developmentIBM Syst J197615322525210.1147/sj.153.02250329.68014
RossSMPeirce’s criterion for the elimination of suspect experimental dataJ Eng Technol20032023841
ZhuHHallPAVMayJHRSoftware unit test coverage and adequacyACM Comput Surv199729436642710.1145/267580.267590
EickSGGravesTLKarrAFMarronJSMockusADoes code decay? Assessing the evidence from change management dataIEEE Trans Softw Eng200127111210.1109/32.895984
9512_CR19
9512_CR18
9512_CR11
9512_CR10
9512_CR32
9512_CR13
9512_CR35
9512_CR12
9512_CR34
9512_CR36
9512_CR17
9512_CR39
9512_CR16
9512_CR38
9512_CR31
9512_CR30
N Nagappan (9512_CR33) 2007
SM Ross (9512_CR37) 2003; 20
KE Emam (9512_CR15) 2001; 27
9512_CR29
LA Belady (9512_CR2) 1976; 15
H Zhu (9512_CR47) 1997; 29
9512_CR22
O Jarczyk (9512_CR25) 2014
9512_CR21
9512_CR43
9512_CR24
9512_CR46
9512_CR23
9512_CR45
9512_CR26
L Breiman (9512_CR7) 2001; 45
9512_CR28
9512_CR27
9512_CR8
9512_CR6
SG Eick (9512_CR14) 2001; 27
J Whitehead (9512_CR44) 2010
9512_CR5
9512_CR4
9512_CR40
9512_CR3
9512_CR20
9512_CR42
9512_CR1
9512_CR41
9512_CR9
References_xml – reference: GitHub Inc (2016c) No license - choose a license. http://choosealicense.com/no-license/, accessed: 2016-03-11
– reference: Gousios G (2013) The GHTorrent dataset and tool suite Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 233–236. http://dl.acm.org/citation.cfm?id=2487085.2487132
– reference: JarczykOGruszkaBJaroszewiczSBukowskiLWierzbickiAGithub projects. Quality analysis of open-source software2014ChamSpringer International Publishing8094
– reference: Munaiah N, Kroh S, Cabrey C, Parikh N (2016b) Reaper - reference implementation. https://github.com/reporeapers/reaper, accessed: 2016-03-11
– reference: GitHub Inc (2016a) Github API v3—github developer guide. https://developer.github.com/v3/, accessed: 2016-03-11
– reference: WhiteheadJMistríkIGrundyJvan der HoekACollaborative software engineering: concepts and techniques2010BerlinSpringer13010.1007/978-3-642-10294-3_11183.68199
– reference: Iowa State University (2016) Publications related to Boa - Boa - Iowa State University. http://boa.cs.iastate.edu/papers/, accessed: 2016-03-11
– reference: CA Technologies (2016) Waffle.io - work better on GitHub issues. https://waffle.io/, accessed: 2016-03-11
– reference: GHTorrent (2016a) Hall of fame. http://ghtorrent.org/halloffame.html, accessed: 2016-03-11
– reference: Vendome C (2015) A large scale study of license usage on GitHub 2015 IEEE/ACM 37th IEEE international conference on software engineering, vol 2, pp 772–774. doi:10.1109/ICSE.2015.245
– reference: Laplante P (2007) What every engineer should know about software engineering. What every engineer should know. CRC Press
– reference: Zenhub (2016) Zenhub - project management for agile teams on GitHub. https://www.zenhub.io/, accessed: 2016-03-11
– reference: Munaiah N, Kroh S, Cabrey C, Nagappan M (2016a) Home of the reporeapers. https://reporeapers.github.io, accessed: 2016-03-11
– reference: Nagappan N, Williams L, Osborne J, Vouk M, Abrahamsson P (2005) Providing test quality feedback using static source code and automatic test suite metrics 16th IEEE international symposium on software reliability engineering (ISSRE’05), pp 10–94. doi:10.1109/ISSRE.2005.35
– reference: Syer MD, Nagappan M, Hassan AE, Adams B (2013) Revisiting prior empirical findings for mobile apps: an empirical case study on the 15 most popular open-source android apps Proceedings of the 2013 conference of the center for advanced studies on collaborative research, IBM corp., riverton, NJ, USA, CASCON ’13, pp 283–297. http://dl.acm.org/citation.cfm?id=2555523.2555553
– reference: de Souza CB, Anquetil N, de Oliveira KM (2005) A study of the documentation essential to software maintenance Proceedings of the 23rd annual international conference on design of communication: documenting & designing for pervasive information, ACM, New York, NY, USA, SIGDOC ’05, pp 68–75. doi:10.1145/1085313.1085331
– reference: GHTorrent (2016b) The relational DB schema. http://ghtorrent.org/relational.html, accessed: 2016-03-11
– reference: RossSMPeirce’s criterion for the elimination of suspect experimental dataJ Eng Technol20032023841
– reference: Tung YH, Chuang CJ, Shan HL (2014) A framework of code reuse in open source software The 16th asia-pacific network operations and management symposium, pp 1–6. doi:10.1109/APNOMS.2014.6996525
– reference: Carlo Z (2016) Githut - programming languages and GitHub. http://githut.info, accessed: 2016-03-11
– reference: Bird C, Nagappan N, Murphy B, Gall H, Devanbu P (2011) Don’t touch my code!: examining the effects of ownership on software quality Proceedings of the 19th ACM SIGSOFT symposium and the 13th european conference on foundations of software engineering, ACM, New York, NY, USA, ESEC/FSE ’11, pp 4–14. doi:10.1145/2025113.2025119
– reference: Danial A (2014) CLOC – Count lines of code. http://cloc.sourceforge.net/, accessed: 2016-03-11, version: 1.62
– reference: Zaidman A, Rompaey BV, Demeyer S, v Deursen A (2008) Mining software repositories to study co-evolution of production & test code 2008 1st international conference on software testing, verification, and validation, pp 220–229. doi:10.1109/ICST.2008.47
– reference: Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013b) Popularity, interoperability, and impact of programming languages in 100,000 open source projects 2013 IEEE 37th annual computer software and applications conference, pp 303–312. doi:10.1109/COMPSAC.2013.55
– reference: Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D (2014) The promises and perils of mining GitHub Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 92–101. doi:10.1145/2597073.2597074
– reference: Bissyandé TF, Thung F, Lo D, Jiang L, Réveillère L (2013a) Orion: a software project search engine with integrated diverse software artifacts 2013 18th international conference on engineering of complex computer systems, pp 242–245. doi:10.1109/ICECCS.2013.42
– reference: EmamKEBenlarbiSGoelNRaiSNThe confounding effect of class size on the validity of object-oriented metricsIEEE Trans Softw Eng200127763065010.1109/32.935855
– reference: Kochhar PS, Bissyandé TF, Lo D, Jiang L (2013) Adoption of software testing in open source projects–a preliminary study on 50,000 projects 2013 17th european conference on software maintenance and reengineering, pp 353–356. doi:10.1109/CSMR.2013.48
– reference: BeladyLALehmanMMA model of large program developmentIBM Syst J197615322525210.1147/sj.153.02250329.68014
– reference: Allamanis M, Sutton C (2013) Mining source code repositories at massive scale using language modeling Proceedings of the 10th working conference on mining software repositories, IEEE Press, Piscataway, NJ, USA, MSR ’13, pp 207–216. http://dl.acm.org/citation.cfm?id=2487085.2487127
– reference: Guzman E, Azócar D, Li Y (2014) Sentiment analysis of commit comments in GitHub: an empirical study Proceedings of the 11th working conference on mining software repositories, ACM, New York, NY, USA, MSR 2014, pp 352–355. doi:10.1145/2597073.2597118
– reference: NagappanNPotential of open source systems as project repositories for empirical studies working group results2007BerlinSpringer103107
– reference: GitHub Inc (2016b) Github archive. https://www.githubarchive.org/, accessed: 2016-06-19
– reference: Mockus A, Fielding R T, Herbsleb J (2000) A case study of open source software development: the apache server Proceedings of the 2000 international conference on software engineering. ICSE 2000 the new millennium, pp 263–272. doi:10.1145/337180.337209
– reference: Sajnani H, Saini V, Ossher J, Lopes CV (2014) Is popularity a measure of quality? an analysis of maven components 2014 IEEE international conference on software maintenance and evolution, pp 231–240. doi:10.1109/ICSME.2014.45
– reference: Software Freedom Law Center (2012) Managing copyright information within a free software project - software freedom law center. http://softwarefreedom.org/resources/2012/managingcopyrightinformation.html, accessed: 2015-05-15
– reference: BreimanLRandom forestsMach Learn200145153210.1023/A:10109334043241007.68152
– reference: EickSGGravesTLKarrAFMarronJSMockusADoes code decay? Assessing the evidence from change management dataIEEE Trans Softw Eng200127111210.1109/32.895984
– reference: HuBoard Inc (2016) Huboard - github issues made awesome. https://huboard.com/, accessed: 2016-03-11
– reference: Ray B, Posnett D, Filkov V, Devanbu P (2014) A large scale study of programming languages and code quality in github Proceedings of the 22nd ACM SIGSOFT international symposium on foundations of software engineering, ACM, New York, NY, USA, FSE 2014, pp 155–165. doi:10.1145/2635868.2635922
– reference: Vasilescu B, van Schuylenburg S, Wulms J, Serebrenik A, van den Brand MGJ (2014) Continuous integration in a social-coding world empirical evidence from GitHub 2014 IEEE international conference on software maintenance and evolution, pp 401–405. doi:10.1109/ICSME.2014.62
– reference: Codetree Studios (2016) Codetree - GitHub issues, managed. https://codetree.com/, accessed: 2016-03-11
– reference: Kofink A (2015) Contributions of the under-appreciated: gender bias in an open-source ecology Companion proceedings of the 2015 ACM SIGPLAN international conference on systems, programming, languages and applications: Software for humanity, ACM, New York, NY, USA, SPLASH Companion 2015, pp 83–84. doi:10.1145/2814189.2815369
– reference: Rosenberg J (1997) Some misconceptions about lines of code Proceedings fourth international software metrics symposium, pp 137–142. doi:10.1109/METRIC.1997.637174
– reference: Dyer R, Nguyen HA, Rajan H, Nguyen TN (2013) Boa: a language and infrastructure for analyzing ultra-large-scale software repositories Proceedings of the 2013 international conference on software engineering, IEEE Press, Piscataway, NJ, USA, ICSE ’13, pp 422–431. http://dl.acm.org/citation.cfm?id=2486788.2486844
– reference: ZhuHHallPAVMayJHRSoftware unit test coverage and adequacyACM Comput Surv199729436642710.1145/267580.267590
– reference: Bissyandé TF, Lo D, Jiang L, Réveillère L, Klein J, Traon YL (2013) Got issues? Who cares about it? A large scale investigation of issue trackers from GitHub 2013 IEEE 24th international symposium on software reliability engineering (ISSRE), pp 188–197. doi:10.1109/ISSRE.2013.6698918
– volume: 27
  start-page: 1
  issue: 1
  year: 2001
  ident: 9512_CR14
  publication-title: IEEE Trans Softw Eng
  doi: 10.1109/32.895984
– ident: 9512_CR29
  doi: 10.1201/9781420006742
– ident: 9512_CR39
– ident: 9512_CR30
  doi: 10.1145/337180.337209
– volume: 29
  start-page: 366
  issue: 4
  year: 1997
  ident: 9512_CR47
  publication-title: ACM Comput Surv
  doi: 10.1145/267580.267590
– ident: 9512_CR35
  doi: 10.1145/2635868.2635922
– ident: 9512_CR8
– ident: 9512_CR38
  doi: 10.1109/ICSME.2014.45
– ident: 9512_CR43
  doi: 10.1109/ICSE.2015.245
– ident: 9512_CR28
  doi: 10.1145/2814189.2815369
– ident: 9512_CR20
– ident: 9512_CR24
– ident: 9512_CR1
– ident: 9512_CR27
  doi: 10.1109/CSMR.2013.48
– ident: 9512_CR22
  doi: 10.1145/2597073.2597118
– volume: 45
  start-page: 5
  issue: 1
  year: 2001
  ident: 9512_CR7
  publication-title: Mach Learn
  doi: 10.1023/A:1010933404324
– ident: 9512_CR32
– start-page: 103
  volume-title: Potential of open source systems as project repositories for empirical studies working group results
  year: 2007
  ident: 9512_CR33
  doi: 10.1007/978-3-540-71301-2_29
– start-page: 1
  volume-title: Collaborative software engineering: concepts and techniques
  year: 2010
  ident: 9512_CR44
  doi: 10.1007/978-3-642-10294-3_1
– ident: 9512_CR26
  doi: 10.1145/2597073.2597074
– ident: 9512_CR17
– ident: 9512_CR19
– ident: 9512_CR40
– ident: 9512_CR13
– ident: 9512_CR41
  doi: 10.1109/APNOMS.2014.6996525
– ident: 9512_CR9
– ident: 9512_CR11
– volume: 15
  start-page: 225
  issue: 3
  year: 1976
  ident: 9512_CR2
  publication-title: IBM Syst J
  doi: 10.1147/sj.153.0225
– volume: 20
  start-page: 38
  issue: 2
  year: 2003
  ident: 9512_CR37
  publication-title: J Eng Technol
– ident: 9512_CR42
  doi: 10.1109/ICSME.2014.62
– ident: 9512_CR5
  doi: 10.1109/ICECCS.2013.42
– ident: 9512_CR21
– ident: 9512_CR6
  doi: 10.1109/COMPSAC.2013.55
– ident: 9512_CR23
– ident: 9512_CR45
  doi: 10.1109/ICST.2008.47
– start-page: 80
  volume-title: Github projects. Quality analysis of open-source software
  year: 2014
  ident: 9512_CR25
  doi: 10.1007/978-3-319-13734-6_6
– ident: 9512_CR3
  doi: 10.1145/2025113.2025119
– ident: 9512_CR46
– ident: 9512_CR4
  doi: 10.1109/ISSRE.2013.6698918
– ident: 9512_CR36
  doi: 10.1109/METRIC.1997.637174
– volume: 27
  start-page: 630
  issue: 7
  year: 2001
  ident: 9512_CR15
  publication-title: IEEE Trans Softw Eng
  doi: 10.1109/32.935855
– ident: 9512_CR10
– ident: 9512_CR31
– ident: 9512_CR34
  doi: 10.1109/ISSRE.2005.35
– ident: 9512_CR12
  doi: 10.1145/1085313.1085331
– ident: 9512_CR16
– ident: 9512_CR18
SSID ssj0009745
Score 2.6014695
Snippet Software forges like GitHub host millions of repositories. Software engineering researchers have been able to take advantage of such a large corpora of...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 3219
SubjectTerms Classification
Classifiers
Compilers
Computer Science
Ground truth
Interpreters
Noise
Programming Languages
Recall
Repositories
Researchers
Software engineering
Software Engineering/Programming and Operating Systems
Title Curating GitHub for engineered software projects
URI https://link.springer.com/article/10.1007/s10664-017-9512-6
https://www.proquest.com/docview/2063574530
Volume 22
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVLSH
  databaseName: SpringerLink Journals
  customDbUrl:
  mediaType: online
  eissn: 1573-7616
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0009745
  issn: 1382-3256
  databaseCode: AFBBN
  dateStart: 19970301
  isFulltext: true
  providerName: Library Specific Holdings
– providerCode: PRVAVX
  databaseName: SpringerLINK - Czech Republic Consortium
  customDbUrl:
  eissn: 1573-7616
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0009745
  issn: 1382-3256
  databaseCode: AGYKE
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://link.springer.com
  providerName: Springer Nature
– providerCode: PRVAVX
  databaseName: SpringerLink Journals (ICM)
  customDbUrl:
  eissn: 1573-7616
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0009745
  issn: 1382-3256
  databaseCode: U2A
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://www.springerlink.com/journals/
  providerName: Springer Nature
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8MwDLZgu3DhjRiMKQdOoEhZ82qP07QxgeDEpHGqkjZBSKigtRN_n6RLKSBA4tzUh8-J7cT2Z4BzajLJeCQxUUpjphOJlfUNzIoormxsdeS7kW_vxGzOrhd8Efq4y6bavUlJ1pb6U7ObEL5iwqcchxEWm9Dlns3LbeJ5NGqZdmU9mdhz62HqHHqTyvxJxFdn1EaY35Kita-Z7sJ2CBLRaK3VPdgwxT7sNAMYUDiPB0DGK6_A4hFdPVWzlUYuAkUmMAyaHJXOxr6ppUHhvaU8hPl0cj-e4TACAWcujqlwYoxzJ84eZUYprmOb5CLOh0TnQipjRcSMdZccTWMrpTHSJLnkSjGrqYgzYukRdIqXwhwDSjjPEsucBE_y51nmqdAx1Yy7S0LO8h6QBos0C_zgfkzFc9oyG3v4Ugdf6uFLRQ8uPn55XZNj_LW43wCchnNSphHxfHiMU9KDywb09vOvwk7-tfoUtiKv9LoKpQ-darkyZy6WqPQAuqOrh5vJoN5D7_MfwHM
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwMhECZaD3rxbaxW5eBJQ0KX1-6xaayrtj21SW8EdsGYmNV0t_HvC1vWqlETz8AcPmBmYGa-AeCSmExQFgmEldKI6kQgZX0Bs8KKKRtbHflq5NGYp1N6P2OzUMddNtnuTUiy1tSfit049xkTPuTYjRBfBxuev8oT5k-j3oppV9SdiT23HiLOoDehzJ9EfDVGKw_zW1C0tjWDXbAdnETYW-7qHlgzxT7YaRowwHAfDwDuL_wGFo_w9qlKFxo6DxSawDBoclg6Hfum5gaG_5byEEwHN5N-ikILBJQ5P6ZCiTHOnDh9lBmlmI5tkvM472Kdc6GM5RE11j1yNImtEMYIk-SCKUWtJjzOsCVHoFW8FOYYwISxLLHUSfAkf55lnnAdE02ZeyTkNG8D3GAhs8AP7ttUPMsVs7GHTzr4pIdP8ja4-ljyuiTH-GtypwFYhntSygh7PjzKCG6D6wb01fCvwk7-NfsCbKaT0VAO78YPp2Ar8gegzkjpgFY1X5gz51dU-rw-R-8NwcHL
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA5aQbz4FqtVc_CkhKab1-6xVGt9FQ8WegvJJhFB1tLd4t832e5aFRU8J5nDN0lmkpn5BoBTYlNBWSQQVkojqhOBlAsFzAorplzsdBSqke-HfDCiN2M2rvqc5nW2ex2SnNc0BJamrGhPjGt_KnzjPGRPhPBjJ0J8GazQwJPgN_Qo6i5Yd0XZpTjw7CHijXsd1vxJxFfDtPA2vwVIS7vT3wTrlcMIu3MNb4Elm22DjboZA6zO5g7AvVlQZvYEr56LwUxD741CW7ENWgNzf9--qamF1d9LvgtG_cvH3gBV7RBQ6n2aAiXWetPi76bUKsV07BLDY9PB2nChrOMRtc4_eDSJnRDWCpsYwZSiThMep9iRPdDIXjO7D2DCWJo46iUEwr_AOE-4jommzD8YDDVNgGssZFpxhYeWFS9ywXIc4JMePhngk7wJzj6WTOZEGX9NbtUAy-rM5DLCgRuPMoKb4LwGfTH8q7CDf80-AasPF315dz28PQRrUdB_mZzSAo1iOrNH3sUo9HG5jd4BpCnGBw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Curating+GitHub+for+engineered+software+projects&rft.jtitle=Empirical+software+engineering+%3A+an+international+journal&rft.au=Munaiah%2C+Nuthan&rft.au=Kroh%2C+Steven&rft.au=Cabrey%2C+Craig&rft.au=Nagappan%2C+Meiyappan&rft.date=2017-12-01&rft.issn=1382-3256&rft.eissn=1573-7616&rft.volume=22&rft.issue=6&rft.spage=3219&rft.epage=3253&rft_id=info:doi/10.1007%2Fs10664-017-9512-6&rft.externalDBID=n%2Fa&rft.externalDocID=10_1007_s10664_017_9512_6
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1382-3256&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1382-3256&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1382-3256&client=summon