GPT 모델을 활용한 보안취약점 탐색

본 논문은 최근 큰 관심을 받는 대규모 언어처리 모델인 GPT의 소프트웨어 보안취약점 탐색에 있어서 활용 가능성과 한계를 탐색했다. 이를 위해, 다양한 오픈소스 프로젝트에서 독립적인 커밋 기록과 Java 파일을 수집하고, 이를 바탕으로 gpt-3.5-turbo-16k 모델의 보안취약점 탐지 능력을 효과적으로 검증했다. 질의응답 과정을 통해 도출된 답변을 5개의 항목으로 나누고 검증하는 과정을 거쳤다. 이 검증 과정에서 GPT 모델 답변의 신뢰성을 검증하기 위해 상용화된 정적분석기 3종과 비교하는 방식을 채택했다. 본 연구는 GPT의...

Full description

Saved in:

Bibliographic Details
Published in	정보과학회 컴퓨팅의 실제 논문지 Vol. 30; no. 12; pp. 611 - 618
Main Authors	장주영(Jooyoung Jang), 배재호(Jaeho Bae), 김대석(Daeseok Kim), 남재창(Jaechang Nam)
Format	Journal Article
Language	Korean
Published	Korean Institute of Information Scientists and Engineers 01.12.2024 한국정보과학회
Subjects	컴퓨터학 GPT model GPT 모델 GPT 3.5 보안취약점 탐색 보안취약점 vulnerability detection security vulnerability
Online Access	Get full text
ISSN	2383-6318 2383-6326
DOI	10.5626/KTCP.2024.30.12.611

Cover

More Information
Summary:	본 논문은 최근 큰 관심을 받는 대규모 언어처리 모델인 GPT의 소프트웨어 보안취약점 탐색에 있어서 활용 가능성과 한계를 탐색했다. 이를 위해, 다양한 오픈소스 프로젝트에서 독립적인 커밋 기록과 Java 파일을 수집하고, 이를 바탕으로 gpt-3.5-turbo-16k 모델의 보안취약점 탐지 능력을 효과적으로 검증했다. 질의응답 과정을 통해 도출된 답변을 5개의 항목으로 나누고 검증하는 과정을 거쳤다. 이 검증 과정에서 GPT 모델 답변의 신뢰성을 검증하기 위해 상용화된 정적분석기 3종과 비교하는 방식을 채택했다. 본 연구는 GPT의 보안취약점 탐지 도구로서의 가능성과 한계를 관찰했다. 이를 바탕으로 후속 연구에 도움이 될 수 있는 중요한 데이터와 결과를 제공하고, 보안취약점 탐색에 관한 연구 분야에 기여하고자 한다. 선행 연구의 한계를 극복하기 위해 보안취약점 탐색에 다양한 상황을 고려할 수 있는 넓은 범위의 데이터를 활용했고, GPT 모델의 답변 신뢰성을 검증했다. This paper explores the potential and limitations of leveraging the Generative Pre-trained Transformer (GPT) model, a widely recognized large-scale language processing model, for the detection of software security vulnerabilities. The study collected independent commit records and Java files from diverse open-source projects and effectively validated the security vulnerability detection capabilities of the gpt-3.5-turbo-16k model using this dataset. The process included categorizing and validating responses obtained through prompting and dividing them into five categories. In the validation phase, a comparison was made with three commercial static analyzers to gauge the reliability of GPT model responses. This research contributes insights into the possibilities and constraints of GPT as a security vulnerability detection tool, providing valuable data and results for future research in the field. To overcome the limitations of prior studies, a comprehensive dataset encompassing various scenarios for security vulnerability exploration was employed, and the reliability of GPT model responses was thoroughly verified. KCI Citation Count: 0
ISSN:	2383-6318 2383-6326
DOI:	10.5626/KTCP.2024.30.12.611