To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 張←?浴?????絶??厭??厓??絶??^ 1001001010100011100000011010100100111111100101111000000100111111001111110011111100111111001111111001000011100010001111110011111110001001011111010011111100111111111110101000110100111111001111111001000011100010001111110011111101011110 92a381a93f97813f3f3f3f3f90e23f3f897d3f3ffa8d3f3f90e23f3f5e
EUC-JP 張←?浴??獒??絶??厭??厓??絶??^ 1100010010100101101000101010101100111111110011011110000100111111001111111000111111001011101110110011111100111111110000001110010000111111001111111011000111011110001111110011111110001111101101001100011100111111001111111100000011100100001111110011111101011110 c4a5a2ab3fcde13f3f8fcbbb3f3fc0e43f3fb1de3f3f8fb4c73f3fc0e43f3f5e
UTF-8 張←뼻浴뜹뤀獒뷂푷絶욐눀厭얗굡厓김짅絶낉풙^ 11100101101111001011010111100010100001101001000011101011101111001011101111100110101101011011010011101011100111001011100111101011101001001000000011100111100011011001001011101011101101111000001011101101100100011011011111100111101101011011011011101100100110101001000011101011100010001000000011100101100011101010110111101100100101101001011111101010101101011010000111100101100011101001001111101010101110011000000011101100101001111000010111100111101101011011011011101011100000101000100111101101100100101001100101011110 e5bcb5e28690ebbcbbe6b5b4eb9cb9eba480e78d92ebb782ed91b7e7b5b6ec9a90eb8880e58eadec9697eab5a1e58e93eab980eca785e7b5b6eb8289ed92995e
UHC 張←뼻浴뜹뤀獒뷂푷絶욐눀厭얗굡厓김짅絶낉풙^ 11101101111001011010000111100111100101101011111011101001101100011011011011100101100011111011000111101000101000111001010011101111101111101000010111101111101111101001111011101110100001111010000111100110111101001011111011101001101100011011011011100100111011011011000111101000101000111001010011101111101111101000010111101111101111101001110001011110 ede5a1e796bee9b1b6e58fb1e8a394efbe85efbe9eee87a1e6f4bee9b1b6e4edb1e8a394efbe85efbe9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)