BHU Digital Repository

Application of Lasso Regression to Model National Development Indicators and National Internet Usage

Show simple item record

dc.contributor.author Musa, Yusuf
dc.contributor.author Babalola, Rotimi
dc.contributor.author Peter, Ogedebe
dc.date.accessioned 2024-05-16T08:06:23Z
dc.date.available 2024-05-16T08:06:23Z
dc.date.issued 2019-01
dc.identifier.citation Musa, Y., Rotimi, B., & Peter, O. (2019). Application of Lasso Regression to Model National Development Indicators and National Internet Usage. Journal of Scientific Research and Reports, 21(5), 1–10. https://doi.org/10.9734/JSRR/2018/31117 en_US
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/1109
dc.description.abstract Aim: This paper aim to use Lasso Regression Model to ascertain how the level of development in a country affects the interest of a number of internet users. Methodology: Least Absolute Shrinkage and Selection Operator (Lasso) regression with the Least Angle Regression selection (LARs) algorithm with k=5-fold cross validation was used to estimate the lasso regression model used to ascertain the significant association between the number of internet user in a country and the development indicators for that country. The change in the cross validation average (mean) squared error at each step was used to identify the best subset of the predictor variables. The lasso regression model was estimated on a training data set consisting of observations from the year 2012 (N=199), and a test data set included the observations from the year 2013 (N=196). Results: LASSO regression model was trained on N=199 countries and used to identify the best subset of predictors which predicted the response variables; Number of internet users in N=196 countries around the world for the year 2013. The Number of internet users for training and test sets per 100 people for the countries ranged from 1.06 to 96.2 and 1.30 to 96.55 respectively. This indicates that there is significant variation in the response variable. Conclusion: It is possible that the few variable indicators we considered as strong predictors of internet are confounded by other factors not considered in the analysis. Therefore, it is recommended that future efforts should focus on other ways to fill in the missing observations since there are large number of national development indicators/factors that are associated with the number of internet users. en_US
dc.description.sponsorship Self en_US
dc.language.iso en en_US
dc.publisher Journal of Scientific Research and Reports en_US
dc.relation.ispartofseries Vol 23;Issue 5
dc.title Application of Lasso Regression to Model National Development Indicators and National Internet Usage en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BHUDR


Advanced Search

Browse

My Account