• redw04@lemmy.ca
    link
    fedilink
    arrow-up
    3
    arrow-down
    7
    ·
    1 year ago

    So uh…as someone who’s currently trying to scrape the web for email addresses to add to my potential client list … where do I start researching this?

    • lutillian@sh.itjust.works
      link
      fedilink
      arrow-up
      4
      ·
      1 year ago

      Start looking into selenium, probably in Python. It’s one of the easier to understand forms of scraping. It’s mainly used to web testing, though you can definitely use it for less… nice purposes.

    • PieMePlenty@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      1 year ago

      Step one will be learning to code in any language. Step two is using a library to help with it. HtmlAgilityPack has always been there for me. Don’t use regex.