web scraping

I wonder if anyone knows how to scrape data from the table in this URL:

http://www.espn.com/nhl/standings/_/group/league

Both Python and Javascript (and others) have a lot of existing third-party frameworks to manipulate the DOM, so perhaps use one of those nodes to write your custom JS or Python utilizing the DOM libraries.

Thanks.

Hi @Learn2019,

You might also have a look at the Palladian Nodes. This KNIME plugin provides some useful nodes for retrieving data from websites and parsing HTML.

If you need to crawl websites, whose DOM tree is modified by JavaScript, the Selenium Nodes might be worth looking at. They are commercial, but a 30 day trial is available on the creator’s website: https://seleniumnodes.com

Best regards,
Daniel

1 Like