To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 贈???錠???頂 100100011010000100111111001111110011111110001111111110010011111100111111001111111001001010111000 91a13f3f3f8ff93f3f3f92b8
EUC-JP 贈???錠???頂 110000101010001100111111001111110011111110111110111110110011111100111111001111111100010010111010 c2a33f3f3fbefb3f3f3fc4ba
UTF-8 贈찔렰렞錠찔렰렟頂 111010001011010010001000111011001011000010010100111010111010000010110000111010111010000010011110111010011000110010100000111011001011000010010100111010111010000010110000111010111010000010011111111010011010000010000010 e8b488ecb094eba0b0eba09ee98ca0ecb094eba0b0eba09fe9a082
UHC 贈찔렰렞錠찔렰렟頂 111100011111110011000010111100011000111010111101100011101010111111101111111111001100001011110001100011101011110110001110101100001111000010100010 f1fcc2f18ebd8eafeffcc2f18ebd8eb0f0a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)