Sunday, 10 December 2017

Mengekstrak Data dari apbd.jakarta.go.id

Status : Draft 

Mendapat email dari Mokhtar Ebrahim, beliau mengatakan, jika konten web, menggunakan javascript dan/atau meload content melalui ajax call,  BeautifulSoup dan Scrappy tidak akan dapat melakukannya, pilihan yang tepat untuk tugas ini adalah Selenium [5]

Referensi

  1. APBD Elektronik Pemerintah Provinsi DKI Jakarta, http://apbd.jakarta.go.id/
  2. How to scrape websites with Python and BeautifulSoup, https://medium.freecodecamp.org/how-to-scrape-websites-with-python-and-beautifulsoup-5946935d93fe
  3. Scrappy, An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way, https://scrapy.org
  4. Mewariskan Semangat Bung Hatta kepada Para Pemuda Karang Taruna di Rukun Tetangga, http://pemerintahan.openthinklabs.com/2017/12/mewariskan-semangat-bung-hatta-kepada-para-pemuda-karang-taruna-di-rukun-tetangga.html
  5. Selenium, https://www.seleniumhq.org
  6. 20+ Python Web Scraping Examples (Beautifulsoup & Selenium), https://likegeeks.com/python-web-scraping/

2 comments:

  1. I know a better guide for web scraping if you want.
    Using BeautifulSoup can't scrape Ajax pages as an example.
    I can't find a mail for me for contact.
    If you are interested, just contact me.

    Regards,

    ReplyDelete
    Replies
    1. Thank you, added, references number 5 and 6 ..

      Delete