Im workin on a screen scrapping enginewhere it uses windows forms Web Browser control to simulate the browsing of website from Login, Search, and get search results via RegEx and Logout after the search. If I use Internet Explorer, the search results works fine. All special characters are displayed properly on the browser. I tried putting the browser contgrol on host Windows Form and it works fine also. But on my code where the web browser is not being used or attached to a UI handle, the browser.DocumentText context is not properly formatted. I cheked the Document.Encoding property and compare it with the one in the Internet Explorer browser and its the same. I tried to set the enoding on runtime into UTF-8 and still it is not working. Special danish chars are displayed as box, which means cannot be interpretted. I tried gettig the string of DocumentText, convert into unicode byte array and convert to ascii and still not working. my OS is windows vista ultimate, language packs installed. Anything else you can suggest? TIA.
Hi I had the same problem in a asynchronus transfer with swedish characters, when I used UTF8 at both sides they showed up as small boxes. But using Encoding.Default. it worked as it should. UTF8 used 2 bytes for every char instead of 1 unsigned as I tought it would. TAB skrev i meddelandet news:14bcd8ea-714a-4103-8c31-7af30f99e5de@b1g2000hsg.googlegroups.com... > Im workin on a screen scrapping enginewhere it uses windows forms Web > Browser control to simulate the browsing of website from Login, > Search, and get search results via RegEx and Logout after the search. > If I use Internet Explorer, the search results works fine. All special > characters are displayed properly on the browser. I tried putting the > browser contgrol on host Windows Form and it works fine also. But on > my code where the web browser is not being used or attached to a UI > handle, the browser.DocumentText context is not properly formatted. I > cheked the Document.Encoding property and compare it with the one in > the Internet Explorer browser and its the same. > > I tried to set the enoding on runtime into UTF-8 and still it is not > working. Special danish chars are displayed as box, which means cannot > be interpretted. I tried gettig the string of DocumentText, convert > into unicode byte array and convert to ascii and still not working. > > my OS is windows vista ultimate, language packs installed. > > Anything else you can suggest? TIA.