Abstract:Currently, the rapid development of technology provides innovation, one of which is the technique of obtaining information from portal websites, termed web scrapers. This application provides data needs in the form of information where the process of retrieving information from sites will later be taken to observe behavior and perceptions to get the right market segmentation. Most data collection is currently still done manually, as a result, this method has several system limitations, namely the length of the data collection process so that it slows down the performance of market segment analysis. The risk is not getting the right market segmentation. To solve this problem, a web scraping news site is needed. In this study, web scraping news sites were created using the python programming language and the flask library to display web scraping. In addition, the Selenium library is used to simplify application creation, facilitate interaction with the Web and provide facilities to control a web browser. This program can retrieve data based on keywords, where the results are in the form of the title, posting date, summary, then collect the data that has been taken into a csv file extension automatically. Keywords: Internet, News, Python, Scraping, Website Abstrak Saat ini, perkembangan pesat teknologi memberikan inovasi, salah satunya adalah teknik memperoleh informasi dari situs web portal, yaitu web scraper. Aplikasi ini menyediakan kebutuhan data berupa informasi dimana proses pengambilan informasi dari situs-situs nantinya diambil untuk diamati perilaku dan persepsi untuk mendapatkan segmentasi pasar yang tepat. Kebanyakan pengambilan data saat ini masih dilakukan secara manual, akibatnya cara ini memiliki beberapa keterbatasan system yaitu lamanya proses pengumpulkan data sehingga memperlambat kinerja analisa segmen pasar. Resikonya adalah tidak mendapatkannya segementasi pasar yang tepat. Untuk mengatasi masalah tersebut diperlukan web scraping situs berita. Pada penelitian ini, web scraping situs berita dibuat dengan menggunakan bahasa pemrograman python dan library flask untuk tampilan web scraping. Selain itu, library Selenium digunakan untuk mempermudah pembuatan aplikasi, mempermudah interaksi dengan Web dan menyediakan fasilitas untuk mengontrol suatu peramban web. Program ini dapat mengambil data berdasarkan kata kunci, dimana hasilnya berupa judul, tanggal postingnya, rangkuman, lalu mengumpulkan data yang telah di ambil ke file berekstensi csv secara otomatis. Kata kunci: Berita, Internet, Python, Scraping, Website

Eksplorasi Teknik Web Scraping pada Data Mining: Pendekatan Pencarian Data Berbasis Python

Penerapan teknik web scraping pada mesin pencari artikel ilmiah

Data Analysis by Web Scraping using Python

Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application

Implementasi Web Scraping untuk Pengambilan Data Pada Website E-Commerce

Web Scraping Situs Berita Menggunakan Bahasa Pemograman Python

Research on Web Mining Technique Facing Electronic Business and Application

Implementation of Web Data Mining Technology Based on Python

Web Scraping - State of Art, Techniques and Approaches

PYTHON-POWERED DATA ANALYSIS THROUGH WEB SCRAPING

Development of online travel Web scraping for tourism statistics in Indonesia

Catching potential customers: an example of Web-mining-aided e-commerce decision-making

Web Scraping with HTML DOM Method for Data Collection of Scientific Articles from Google Scholar

Web Scraping using Natural Language Processing: Exploiting Unstructured Text for Data Extraction and Analysis

Medical ministrations through web scraping

Implementation of Web Scraping for Journal Data Collection on the SINTA Website

Use of Data Warehouse and Data Mining for Academic Data : A Case Study at a National University

Web Scraping and Naïve Bayes Classification for Job Search Engine

Personalized Content Extraction and Text Classification Using Effective Web Scraping Techniques

The Value of Web Data Scraping: An Application to TripAdvisor

How to harness the power of web scraping for medical and surgical research: An application in estimating international collaboration