To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 題???趙??終?U}題???趙??終?U{^ 1001000111101000001111110011111100111111111001101110001000111111001111111000111101001001001111110101010101111101100100011110100000111111001111110011111111100110111000100011111100111111100011110100100100111111010101010111101101011110 91e83f3f3fe6e23f3f8f493f557d91e83f3f3fe6e23f3f8f493f557b5e
EUC-JP 題???趙??終?U}題???趙??終?U{^ 1100001011101010001111110011111100111111111011001110010000111111001111111011110110101010001111110101010101111101110000101110101000111111001111110011111111101100111001000011111100111111101111011010101000111111010101010111101101011110 c2ea3f3f3fece43f3fbdaa3f557dc2ea3f3f3fece43f3fbdaa3f557b5e
UTF-8 題띳렰렪趙얗섦終렫U}題띳렰렪趙얗섦終렫U{^ 1110100110100001100011001110101110011101101100111110101110100000101100001110101110100000101010101110100010110110100110011110110010010110100101111110110010000100101001101110011110110101100000101110101110100000101010110101010101111101111010011010000110001100111010111001110110110011111010111010000010110000111010111010000010101010111010001011011010011001111011001001011010010111111011001000010010100110111001111011010110000010111010111010000010101011010101010111101101011110 e9a18ceb9db3eba0b0eba0aae8b699ec9697ec84a6e7b582eba0ab557de9a18ceb9db3eba0b0eba0aae8b699ec9697ec84a6e7b582eba0ab557b5e
UHC 題띳렰렪趙얗섦終렫U}題띳렰렪趙얗섦終렫U{^ 1111000010111001101101101111000110001110101111011000111010111000111100001110000110111110111010011011110010110100111100001111101110001110101110010101010101111101111100001011100110110110111100011000111010111101100011101011100011110000111000011011111011101001101111001011010011110000111110111000111010111001010101010111101101011110 f0b9b6f18ebd8eb8f0e1bee9bcb4f0fb8eb9557df0b9b6f18ebd8eb8f0e1bee9bcb4f0fb8eb9557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)