c# - How can i download an image from html documents with wildcards -
i'm writing c# program pull .jpg image html document, name of target image changes every often. me being new programmer, can not figure out how achieve desired result.
i using webclient download html.
so guess have few questions ask here.
- how can use wildcard assume name , length of image name?
- and how can trim html containers away target image in document?
in short, using approach you've described: can't. http requires each individual requested resource accessed name, cannot ask http server return set of resources names match pattern (be wildcard expression or regex).
if, however, know names exist between particular range , follow pattern create series of requests , handle 404 errors accordingly, so:
string resource = "/images/aestheticallyattractivehumanfemaleswithoutclothing/img_{0}.jpg"; for(int i=1;i<100;i++) { string thisresource = string.format(cultureinfo.invariantculture, resource, i); httpwebrequest request = new (httpwebrequest)webrequest.create(thisresource); httpwebresponse response = request.getresponse(); if( response.status == httpstatus.ok ) { using(stream rs = response.getresponsestream()) using(filestream fs = new filestream(path.combine("c:\\temp\\irstaxreturns2011\\" + i.tostring() + ".jpg") { rs.copyto( fs ); } } }
Comments
Post a Comment