java - Data Extraction? -


I am looking for ways to extract different data from different websites. I know there are programs that you can buy, but to know that I am trying to find out that I want to do it myself, does anyone have any suggestions on any general structure and if so In which language you will write it. My first thought was Java, but I am willing and thankful to anyone else's opinion.

What kind of data are you trying to remove from websites? What websites? e.t.c. A little bit more info on your thoughts / project will be helpful

Recently, I have to try some HTML parses so that I can get some data that I want in a more consolidated format.

I tried JTD () and saw in Web-Harvest () Jaiti was not much that I wanted and web-harvest was overkill.

I finally took nestled am

I'm developing very little to get it to use Java + htmlparser () and Actimpirsr allows you to 'filter' Dome searches for specific things


Comments