Skip to main content

JAVA: How can I download an HTML file from a site that requires cookies enabled?



I'm trying to download an HTML file from a site. I'm using the following simple method:







URL url = new URL("here goes the link to the html file");

BufferedReader br = new BufferedReader(new InputStreamReader(url.openStream()));

String htmlfile = "";

String temp;

while ((temp = br.readLine()) != null) {

htmlfile+= temp;

}







The problem is that I get the following String in the htmlfile variable:







The installation of ... requires the acceptance of a cookie by your browser

software. The cookie is used to ensure that you and only you are

able to access information ....







In other words, I need to somewhat enable cookies when opening a stream from the url. Is it possible to achieve this by using URL or do I need a different method? Thanks in advance


Comments

  1. If you use a good library like Apache HttpComponents

    http://hc.apache.org/index.html

    it takes care of cookie-management for you.

    ReplyDelete
  2. You can use addRequestProperty() to set a cookie on a URLConnection object, e.g.

    URL url = new URL("here goes the link to the html file");
    URLConnection connection = url.openConnection();
    connection.addRequestProperty("Cookie", "here goes the cookie");
    BufferedReader br = new BufferedReader(new InputStreamReader(connection.getInputStream()));

    ReplyDelete

Post a Comment

Popular posts from this blog

[韓日関係] 首相含む大幅な内閣改造の可能性…早ければ来月10日ごろ=韓国

div not scrolling properly with slimScroll plugin

I am using the slimScroll plugin for jQuery by Piotr Rochala Which is a great plugin for nice scrollbars on most browsers but I am stuck because I am using it for a chat box and whenever the user appends new text to the boxit does scroll using the .scrollTop() method however the plugin's scrollbar doesnt scroll with it and when the user wants to look though the chat history it will start scrolling from near the top. I have made a quick demo of my situation http://jsfiddle.net/DY9CT/2/ Does anyone know how to solve this problem?

Why does this javascript based printing cause Safari to refresh the page?

The page I am working on has a javascript function executed to print parts of the page. For some reason, printing in Safari, causes the window to somehow update. I say somehow, because it does not really refresh as in reload the page, but rather it starts the "rendering" of the page from start, i.e. scroll to top, flash animations start from 0, and so forth. The effect is reproduced by this fiddle: http://jsfiddle.net/fYmnB/ Clicking the print button and finishing or cancelling a print in Safari causes the screen to "go white" for a sec, which in my real website manifests itself as something "like" a reload. While running print button with, let's say, Firefox, just opens and closes the print dialogue without affecting the fiddle page in any way. Is there something with my way of calling the browsers print method that causes this, or how can it be explained - and preferably, avoided? P.S.: On my real site the same occurs with Chrome. In the ex