Developing an Indonesia's health literacy short-form survey questionnaire (HLS-EU-SQ10-IDN) using the feature selection and genetic algorithm
•The short version of HLS-EU-Q47 was needed to construct and to apply in health literacy research in Indonesia because the previous health literacy study using HLS-EU-Q47 found difficulties on the interviewing process because the time-consuming of the interview.•This is the first study applied data...
        Saved in:
      
    
          | Published in | Computer methods and programs in biomedicine Vol. 182; p. 105047 | 
|---|---|
| Main Authors | , , , , , , , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Ireland
          Elsevier B.V
    
        01.12.2019
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 0169-2607 1872-7565 1872-7565  | 
| DOI | 10.1016/j.cmpb.2019.105047 | 
Cover
| Summary: | •The short version of HLS-EU-Q47 was needed to construct and to apply in health literacy research in Indonesia because the previous health literacy study using HLS-EU-Q47 found difficulties on the interviewing process because the time-consuming of the interview.•This is the first study applied data mining technique to form a short questionnaire on health literacy research.•A data mining technique using feature selection can be applied to develop a short version of the questionnaire and has been proven to have better accuracy compared to the short versions generated by a traditional statistical technique.•The combination between genetic and k-NN algorithm make the selection of the features more precise to predict the label even though fewer features are included in the construct of the questionnaire.•The result of the study (HLS-EU-SQ10 IDN) as the measurement tool of health literacy in Indonesia should accustom in Indonesia's circumstances. HLS-EU-SQ10 IDN focus on features of finding information on health because health information seeking behavior still challenging in developing countries such as Indonesia even though for the health care professionals.
Measuring health literacy becomes more important because its association with health status and healthcare outcomes. Studies have developed at least 133 measurement tools for health literacy. HLS-EU-Q47 is a questionnaire consisting of 12 sub-dimensions and 47 questions developed by the Europe Health Literacy Consortium. Many countries in Europe and Asia have used HLS-EU-Q47 as a tool for measuring health literacy in the general public. Indonesia has conducted general health literacy survey using HLS-EU-Q47 but finding the difficulties because of the time-consuming interview. A shorter version of HLS-EU-Q47 is needed to apply in health literacy researches in Indonesia. This paper reports the results of feature reduction to develop a short Indonesian version HLS-EU questionnaire and measures the accuracy of the model compared with other short form like HLS-EU-SQ16 or HLS-SF12.
The analysis was performed on a population-based dataset from Indonesia-Semarang Health Literacy Survey for which there were specific target variables as the classification of health literacy level. All attributes were assessed as potential targets in the models derived from the full dataset and its subsets. The feature selection methods with genetic algorithm were used as the filter as well as validation (cross validation) and classification (k-NN:k-nearest neighbor). The predictive accuracy of health literacy level and the complexity of models based on the reduced datasets were compared among the methods and other short versions such as HLS-EU-SQ16, HLS-SF12.
The accuracy of the existing short form models were 90.64% with the HLS-EU-SQ16 and 88.67% with the HLS-SF12. This study proposed a model with 10 features as the construct of a short Indonesian-version (proposed as the HLS-EU-SQ10-IDN) since the model was with higher accuracy than the HLS-SF12, but fewer features for measuring general health literacy index. Moreover, the short version only completed part of 12 dimensions of the full questionnare.
A data mining technique using feature selection with combination of genetic algorithm and k-NN algorithm was applied to develop a short version questionnaire and proved to have better accuracy, as compared with the short version developed by traditional statistical technique. | 
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23  | 
| ISSN: | 0169-2607 1872-7565 1872-7565  | 
| DOI: | 10.1016/j.cmpb.2019.105047 |