java - Fail to get full content of page with JSoup -
i trying fetch content following page jsoup:
http://www.etronics.com/appliances/cooking.html#!/limit=all
i'm requesting page jsoup follow:
jsoup.connect(url).useragent(user_agent).timeout(timeoutinms).data("limit","all").get().outerhtml(); where
user_agent = "mozilla/5.0 (x11; linux x86_64) applewebkit/537.36 (khtml, gecko) chrome/33.0.1750.146 safari/537.36"; i expect page containing 990 products 384. have page content in browser.
as explained here may caused jsoup not executing javascript i'm nor sure cause of problem or @ least don't know how check that.
how can obtain every elements visible via page view source ?
try loading webpage inside htmlunit, execute javascript (you can read how in docs) - allows access webpage dom.
you ould fire developer tools in browser when viewing page , see how many http requests makes , - if loading products in separate requests there's scripting involved.
Comments
Post a Comment