An Effective Topic Modeling Strategies for Recommender Systems in Crowdfunding Platforms

Authors

  • Suresh Subramanian College of Information Technology, Ahlia University, Kingdom of Bahrain. Author

DOI:

https://doi.org/10.56294/dm2024.349

Keywords:

Latent Dirichlet Allocation, Latent Semantic Analysis, Support Vector Machine, eXtreme Gradient Boosting, Random Forest, Crowdfunding

Abstract

capitalists come up with creative and innovative concepts, but a lack of finance limits their untapped economic potential. There are several channels that new entrepreneurs may use and take advantage of to attract money and other financial resources when beginning a firm thanks to current technology, which has drastically altered the way business is done on a broad scale. An entrepreneur uses the Internet to promote his concept to potential backers through crowdfunding. Online crowdfunding has labored to develop several advanced platforms that may serve as an interface to the fundraising process for a certain concept or project. Typically, the owner of the concept explores the market and does extensive research through a variety of channels, with the Internet assisting in moving ahead and making the idea actual. In truth, the owner of the concept frequently suffers obstacles and financial issues, therefore crowdsourcing helps to alleviate these issues. In this study, machine learning methods were used to train the system on the given data, beginning with the theme, followed by the blurb, which is the topic description, and finally by the topic category. Latent Dirichlet Allocation (LDA) and Latent Semantic Analysis (LSA) were employed as machine learning approaches to accomplish the goal. This study employs a variety of text classification algorithms, including Support Vector Machine (SVM), EXtreme Gradient Boosting (XG), K-Nearest Neighbours (KNN), and Random Forest (RF), to propose and forecast subject categories. Each algorithm performed differently in terms of precision, predictability, positive rate, and model correctness. SVM was the highest performance measuremen

References

1) Jin W. Research on Machine Learning and Its Algorithms and Development. J Phys Conf Ser. 2020;1544(1). doi: 10.1088/1742-6596/1544/1/012003.

2) Kolluri J, Razia S, Nayak SR. Text Classification Using Machine Learning and Deep Learning Models. SSRN Electron J. 2020. doi: 10.2139/ssrn.3618895.

3) Aljebory KM, Jwmah YM, Mohammed TS. Classification of EMG Signals: Using DWT Features and ANN Classifier. IAENG Int J Comput Sci. 2024;51(1).

4) Naqa IE, Murphy MJ. Machine Learning in Radiation Oncology. In: Machine Learning in Radiation Oncology. 2015. p. 3–11. doi: 10.1007/978-3-319-18305-3.

5) Hannigan TR, et al. Topic modelling in management research: Rendering new theory from textual data. Acad Manag Ann. 2019;13(2):586–632. doi: 10.5465/annals.2017.0099.

6) Hasan M, Hossain MM, Ahmed A, Rahman MS. Topic Modelling: A Comparison of the Performance of Latent Dirichlet Allocation and LDA2vec Model on Bangla Newspaper. 2019 Int Conf Bangla Speech Lang Process ICBSLP. 2019 Sep;27–8. doi: 10.1109/ICBSLP47725.2019.202047.

7) Shafqat W, Byun YC. A recommendation mechanism for under-emphasized tourist spots using topic modelling and sentiment analysis. Sustainability. 2020;12(1). doi: 10.3390/SU12010320.

8) Debortoli S, Müller O, Junglas I, vom Brocke J. Text Mining For Information Systems Researchers: An Annotated Topic Modeling Tutorial. Commun Assoc Inf Syst. 2016;39. doi: 10.17705/1CAIS.03907.

9) Iyyappan M, Ahmad S, Jha S, Alam A, Yaseen M, Hikmat A. A Novel AI-Based Stock Market Prediction Using Machine Learning Algorithm. Sci Program. 2022;1-11. doi: 10.1155/2022/4808088.

10) Wei L, et al. A Lightweight Sentiment Analysis Framework for a Micro-Intelligent Terminal. Sensors. 2023;23(2):741. doi: 10.3390/s23020741.

11) Alfred R, Loo YJ, Obit JH, Lim Y, Haviluddin H, Azman A. Social media mining: a genetically based multiobjective clustering approach to topic modelling. 2021.

12) Hsiang FY, Chia HH, Yu CJ, Chih JL. LibShortText: A Library for Short-text Classification and Analysis. 2013. Available at: https://www.csie.ntu.edu.tw/~cjlin/libshorttext/.

13) Wang W, et al. Signaling persuasion in crowdfunding entrepreneurial narratives: The subjectivity vs objectivity debate. Comput Human Behav. 2021;114(Sep 2020):106576. doi: 10.1016/j.chb.2020.106576.

14) Wang W, Wu YJ. Online Financing Campaigns’ Comments: Insights from Crowdfunding Pitches. In: Visvizi A, Lytras MD, Aljohani NR, editors. Research and Innovation Forum 2020 - Disruptive Technologies in Times of Change. Springer; 2021. p. 485-93. doi: 10.1007/978-3-030-62066-0_37.

15) Peng N, et al. Predicting fundraising performance in medical crowdfunding campaigns using machine learning. Electronics (Switzerland). 2021;10(2):1-16. doi: 10.3390/electronics10020143.

16) Yuan H, Lau RYK, Xu W. The determinants of crowdfunding success: A semantic text analytics approach. Decis Support Syst. 2016;91:67-76. doi: 10.1016/j.dss.2016.08.001.

17) Zhao Y, Harris P, Lam W. Crowdfunding industry—History, development, policies, and potential issues. J Public Aff. 2019;19(1). doi: 10.1002/pa.1921.

18) Robinson D. broom: An R Package for Converting Statistical Analysis Objects Into Tidy Data Frames. 2014. Available at: https://arxiv.org/abs/1412.3565.

19) Slavik S. The Business Model of Start-Up — Structure and Consequences. Adm Sci. 2019;9(69):1-23. doi: 10.3390/admsci9030069.

20) McGowan E. What is Crowdfunding? 2018. Available at: https://www.startups.com/library/expert-advice/what-is-crowdfunding.

21) Uthirapathy SE, Sandanam D. Topic Modelling and Opinion Analysis On Climate Change Twitter Data Using LDA And BERT Model. Procedia Comput Sci. 2023;218:908-17.

22) Ogunleye B, Maswera T, Hirsch L, Gaudoin J, Brunsdon T. Comparison of topic modelling approaches in the banking context. Appl Sci. 2023;13(2):797.

23) N R, Nachiappan B, Kalpana C, Mohanraj A, Prabhu Shankar B, Viji C. Machine Learning-Based System for Automated Presentation Generation from CSV Data. Data and Metadata [Internet]. 2024 Jul. 2 [cited 2024 Jul. 8];3:359. Available from: https://dm.saludcyt.ar/index.php/dm/article/view/359.

24) Nachiappan B, Rajkumar N, Viji C, A M. Artificial and Deceitful Faces Detection Using Machine Learning. Salud, Ciencia y Tecnología - Serie de Conferencias [Internet]. 2024 Mar. 11 [cited 2024 Jul. 8];3:611. Available from:

https://conferencias.saludcyt.ar/index.php/sctconf/article/view/611

Downloads

Published

2024-01-01

Issue

Section

Original

How to Cite

1.
Subramanian S. An Effective Topic Modeling Strategies for Recommender Systems in Crowdfunding Platforms. Data and Metadata [Internet]. 2024 Jan. 1 [cited 2024 Oct. 13];3:.349. Available from: https://dm.ageditor.ar/index.php/dm/article/view/349