Please use this identifier to cite or link to this item:
http://localhost:8080/xmlui/handle/123456789/1109
Title: | Application of Lasso Regression to Model National Development Indicators and National Internet Usage |
Authors: | Musa, Yusuf Babalola, Rotimi Peter, Ogedebe |
Issue Date: | Jan-2019 |
Publisher: | Journal of Scientific Research and Reports |
Citation: | Musa, Y., Rotimi, B., & Peter, O. (2019). Application of Lasso Regression to Model National Development Indicators and National Internet Usage. Journal of Scientific Research and Reports, 21(5), 1–10. https://doi.org/10.9734/JSRR/2018/31117 |
Series/Report no.: | Vol 23;Issue 5 |
Abstract: | Aim: This paper aim to use Lasso Regression Model to ascertain how the level of development in a country affects the interest of a number of internet users. Methodology: Least Absolute Shrinkage and Selection Operator (Lasso) regression with the Least Angle Regression selection (LARs) algorithm with k=5-fold cross validation was used to estimate the lasso regression model used to ascertain the significant association between the number of internet user in a country and the development indicators for that country. The change in the cross validation average (mean) squared error at each step was used to identify the best subset of the predictor variables. The lasso regression model was estimated on a training data set consisting of observations from the year 2012 (N=199), and a test data set included the observations from the year 2013 (N=196). Results: LASSO regression model was trained on N=199 countries and used to identify the best subset of predictors which predicted the response variables; Number of internet users in N=196 countries around the world for the year 2013. The Number of internet users for training and test sets per 100 people for the countries ranged from 1.06 to 96.2 and 1.30 to 96.55 respectively. This indicates that there is significant variation in the response variable. Conclusion: It is possible that the few variable indicators we considered as strong predictors of internet are confounded by other factors not considered in the analysis. Therefore, it is recommended that future efforts should focus on other ways to fill in the missing observations since there are large number of national development indicators/factors that are associated with the number of internet users. |
URI: | http://localhost:8080/xmlui/handle/123456789/1109 |
Appears in Collections: | Research Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Application of Lasso Regression to Model National Development Indicators and National Internet Usage.pdf | Main Article | 308.94 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.