Skip to main content

JAVA: How can I download an HTML file from a site that requires cookies enabled?



I'm trying to download an HTML file from a site. I'm using the following simple method:







URL url = new URL("here goes the link to the html file");

BufferedReader br = new BufferedReader(new InputStreamReader(url.openStream()));

String htmlfile = "";

String temp;

while ((temp = br.readLine()) != null) {

htmlfile+= temp;

}







The problem is that I get the following String in the htmlfile variable:







The installation of ... requires the acceptance of a cookie by your browser

software. The cookie is used to ensure that you and only you are

able to access information ....







In other words, I need to somewhat enable cookies when opening a stream from the url. Is it possible to achieve this by using URL or do I need a different method? Thanks in advance


Comments

  1. If you use a good library like Apache HttpComponents

    http://hc.apache.org/index.html

    it takes care of cookie-management for you.

    ReplyDelete
  2. You can use addRequestProperty() to set a cookie on a URLConnection object, e.g.

    URL url = new URL("here goes the link to the html file");
    URLConnection connection = url.openConnection();
    connection.addRequestProperty("Cookie", "here goes the cookie");
    BufferedReader br = new BufferedReader(new InputStreamReader(connection.getInputStream()));

    ReplyDelete

Post a Comment

Popular posts from this blog

Slow Android emulator

I have a 2.67 GHz Celeron processor, 1.21 GB of RAM on a x86 Windows XP Professional machine. My understanding is that the Android emulator should start fairly quickly on such a machine, but for me it does not. I have followed all instructions in setting up the IDE, SDKs, JDKs and such and have had some success in staring the emulator quickly but is very particulary. How can I, if possible, fix this problem?

CCNA 1 Final Exam 2011 latest (hot hot hot)

  Hi! I have been posted content of ccna1 final exam (latest and only question.) I will post the answer and insert image on sunday. If you care, please subscribe your email an become a first person have full test content. Subcribe now  Some question  have not content because this question have images content. So that can you wait for me? SUNDAY 1. A user sees the command prompt: Router(config-if)# . What task can be performed at this mode? Reload the device. Perform basic tests. Configure individual interfaces. Configure individual terminal lines. 2. Refer to the exhibit. Host A attempts to establish a TCP/IP session with host C. During this attempt, a frame was captured with the source MAC address 0050.7320.D632 and the destination MAC address 0030.8517.44C4. The packet inside the captured frame has an IP source address 192.168.7.5, and the destination IP address is 192.168.219.24. At which point in the network was this packet captured? leaving host A leaving ATL leaving...