There is a new KNIME forum. You can still browse and read content from our old forum but if you want to create new posts or join ongoing discussions, please visit our new KNIME forum: https://forum.knime.com

Retrieving an http that spans over multiple pages

Member for

6 years 5 months alfroc

Hi, 
maybe this is a bit OT since it does not concern Knime strictly but I really need hints, if any.

I'm trying to retrieve an url about the reviews of a book that span over multiple pages. In the attached wf I can read only the first 20 reviews instead of the whole 103.

Any suggestions are welcome.
Thanks!
Alfredo

Comments
Mon, 10/09/2017 - 10:55

Member for

1 year 1 month

Marten Pfannenschmidt

Hi Alfredo,

the website you are requesting only provides the 20 most frequent read reviews. The others are provided by JavaScript, which makes it a little tricky to fetch them. There is an existing topic about a similar question: https://www.knime.com/forum/knime-textprocessing/how-to-analyse-a-website

Hope that helps.

Cheers,
Marten

Wed, 10/11/2017 - 09:29

Member for

6 years 5 months

alfroc

Thank you, Marten.