Hi guys, i am working on some project and i need to extract comments from a forum,blog web page.
The problem is the different implementation on different pages, some have div element with id="comment" or class="comment" or div inside div or some dont have any id/class with comments -- so many possibilities are there.
I need a general solution how to get all comments from any web page.
I am doing it in java and using Jsoup lib for it which is working well to extract contents, but how to identify the comment blocks with so many different possibilities. If it is not possible then there should be a standard way of writing code for comment block that will have id or class with value as comment.
Any suggestions.Thanx.
The problem is the different implementation on different pages, some have div element with id="comment" or class="comment" or div inside div or some dont have any id/class with comments -- so many possibilities are there.
I need a general solution how to get all comments from any web page.
I am doing it in java and using Jsoup lib for it which is working well to extract contents, but how to identify the comment blocks with so many different possibilities. If it is not possible then there should be a standard way of writing code for comment block that will have id or class with value as comment.
Any suggestions.Thanx.