character encoding - What is the charset of URLs? -
when types url in browser access page, charset used url? there standard? can consider utf-8 used everywhere? characters accepted?
urls may contain subset of ascii, urls valid ascii.
international domain names must punycode encoded. non-ascii characters in path or query parts must encoded, percent-encoding beingness agreed-upon standard.
percent-encoding takes raw bytes , encodes each byte %xx
. there's no followed standard on encoding should used determine byte representation. such, it's impossible assume particular character set beingness used in percent-encoded representation. if you're creating links, you're in total command on used charset before percent-encoding; if you're not, you're out of luck. though encounter utf-8, not guaranteed.
url character-encoding
No comments:
Post a Comment