Aplikasi Web Untuk Visualisasi Web Scraping Menggunakan Metode VSM

Andi Nurkholis(1*), Yusra Fernando(2), Faris Arkan Ans(3),

(1) Universitas Teknokrat Indonesia, Indonesia
(2) Universitas Teknokrat Indonesia, Indonesia
(3) Universitas Teknokrat Indonesia, Indonesia
(*) Corresponding Author

Abstract


The internet, like the workplace, is at the heart of every aspect of communal life in the modern digital era. Many platforms already provide job vacancies, especially for independent contractors. To identify relevant job openings, consumers typically need to access multiple websites to gather this information. One way to overcome this problem is to use web scraping. BeautifulSoup and Selenium libraries will be used to collect data in accordance with previous research findings. The vector space model approach is used to determine the degree of data similarity between queries and documents to perform data searches. After examining the data, a mean accuracy value of 56% was obtained and a mean perfect recall value of 100%. This is because, even if the context does not match, data searches use three parameters, increasing the likelihood of returning irrelevant material if the document contains words from the user's query. Users can manage the processes of web scraping, data processing, and data searching with the help of the Streamlit framework in Python, which displays the results of data processing. To obtain data from the Sribulancer, Project and freelancing freelancer websites, this research will use web scraping techniques. Users can search for data from multiple websites using a vector space model approach rather than accessing each loose website one by one. Web scraping results can also be processed to be displayed in a more user-friendly format and save time by using data visualization in the form of a web application built using the Streamlit framework.

Full Text:

PDF

References


W. Wahyu Agung Firrezqi, “Peran Situs Freelance Project. co. id Dalam Membantu Masalah Perekonomian di Indonesia,” Analisis Peran Situs Freelance Project. co. id Dalam Membantu Masalah Perekonomian di Indonesia, vol. 2, no. 2, pp. 1–8, 2020.

M. Ayoobzadeh, “Freelance job search during times of uncertainty: protean career orientation, career competencies and job search,” Personnel review, vol. 51, no. 1, pp. 40–56, 2022.

M. Mustofa, “Pekerja Lepas (Freelancer) dalam Dunia Bisnis,” Jurnal MoZaiK, vol. 10, no. 1, pp. 19–25, 2018.

S. Kadam, S. Shinde, A. Sharma, S. Mali, and B. E. Student, “Price comparison of computer parts using web scraping,” Int. J. Eng. Sci, 2018.

K. Henrys, “Importance of web scraping in e-commerce and e-marketing,” Available at SSRN 3769593, 2021.

R. Ridwan and T. A. Hermawan, “Penerapan mesin pencari informasi dengan menggunakan metode Vector Space Model,” Jurnal Teknik Informatika (JUTEKIN), vol. 7, no. 2, 2019.

M. Eminagaoglu, “A new similarity measure for vector space models in text classification and information retrieval,” J Inf Sci, vol. 48, no. 4, pp. 463–476, 2022.

S. Han and C. K. Anderson, “Web scraping for hospitality research: Overview, opportunities, and implications,” Cornell Hospitality Quarterly, vol. 62, no. 1, pp. 89–104, 2021.

J. N. Semendawai, I. Febiola, B. Pamungkas, and M. D. Ruliansyah, “Perancangan Aplikasi Otomatisasi Menggunakan Bahasa Pemrograman Python Pada Aktivitas Monitoring Pemakaian Data Harian Kartu Internet Of Things,” Jurnal Rekayasa Elektro Sriwijaya, vol. 3, no. 1, pp. 193–198, 2021.

A. Anna and A. Hendini, “Implementasi vector space model pada sistem pencarian mesin karaoke,” Evolusi : Jurnal Sains dan Manajemen, vol. 6, no. 1, Mar. 2018, doi: 10.31294/evolusi.v6i1.3535.

Y. Julianto, D. H. Setiabudi, and S. Rostianingsih, “Analisis Sentimen Ulasan Restoran Menggunakan Metode Support Vector Machine,” Jurnal Infra, vol. 10, no. 1, pp. 1–7, 2022.

A. Nurkholis, D. Alita, and A. Munandar, “Comparison of Kernel Support Vector Machine Multi-Class in PPKM Sentiment Analysis on Twitter,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 2, Apr. 2022.

A. Nurkholis, Z. Abidin, and H. Sulistiani, “Optimasi Parameter Support Vector Machine Berbasis Algoritma Firefly Pada Data Opini Film,” Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), vol. 5, no. 5, pp. 904–910, 2021.

F. Amin, “Sistem Temu Kembali Informasi dengan Pemeringkatan Metode Vector Space Model,” Dinamik, vol. 18, no. 2, 2013.

G. Sidorov and G. Sidorov, “Vector Space Model for Texts and the tf-idf Measure,” Syntactic n-grams in Computational Linguistics, pp. 11–15, 2019.

B. P. Zen, I. Susanto, and D. Finaliamartha, “TF-IDF Method and Vector Space Model Regarding the Covid-19 Vaccine on Online News,” Sinkron: jurnal dan penelitian teknik informatika, vol. 6, no. 1, pp. 69–79, 2021.

M. A. Azis, A. Hamid, A. Fauzi, E. Yulianto, and V. Riyanto, “Information retrieval system in text-based skripsi document search file using vector space model method,” in Journal of Physics: Conference Series, 2019, vol. 1367, no. 1, p. 012016.




DOI: http://dx.doi.org/10.30645/j-sakti.v7i2.663

Refbacks

  • There are currently no refbacks.



J-SAKTI (Jurnal Sains Komputer & Informatika)
Published Papers Indexed/Abstracted By:


Jumlah Kunjungan :

View My Stats