To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 庄??奉∝駿?臍?庄??奉∝駿?臍?^ 1000111110101111001111110011111110010101111100101000000111100101100011110111100000111111111001000110000000111111100011111010111100111111001111111001010111110010100000011110010110001111011110000011111111100100011000000011111101011110 8faf3f3f95f281e58f783fe4603f8faf3f3f95f281e58f783fe4603f5e
EUC-JP 庄??奉∝駿?臍?庄??奉∝駿?臍?^ 1011111010110001001111110011111111001010111101001010001011100111101111011101100100111111111001111100000100111111101111101011000100111111001111111100101011110100101000101110011110111101110110010011111111100111110000010011111101011110 beb13f3fcaf4a2e7bdd93fe7c13fbeb13f3fcaf4a2e7bdd93fe7c13f5e
UTF-8 庄얏렫奉∝駿렯臍뻠庄얏렫奉∝駿렯臍뻠^ 11100101101110101000010011101100100101101000111111101011101000001010101111100101101001011000100111100010100010001001110111101001101001111011111111101011101000001010111111101000100001111000110111101011101110111010000011100101101110101000010011101100100101101000111111101011101000001010101111100101101001011000100111100010100010001001110111101001101001111011111111101011101000001010111111101000100001111000110111101011101110111010000001011110 e5ba84ec968feba0abe5a589e2889de9a7bfeba0afe8878debbba0e5ba84ec968feba0abe5a589e2889de9a7bfeba0afe8878debbba05e
UHC 庄얏렫奉∝駿렯臍뻠庄얏렫奉∝駿렯臍뻠^ 11101101111001001011111011100110100011101011100111011100111001011010000111110000111100011110011110001110101111001111000010110000101110111011101011101101111001001011111011100110100011101011100111011100111001011010000111110000111100011110011110001110101111001111000010110000101110111011101001011110 ede4bee68eb9dce5a1f0f1e78ebcf0b0bbbaede4bee68eb9dce5a1f0f1e78ebcf0b0bbba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)