To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???橈?????僥??瑤??嗚??絶?? 0011111100111111001111111001111011110100001111110011111100111111001111110011111110011001010001100011111100111111111010101010001000111111001111111001101001101010001111110011111110010000111000100011111100111111 3f3f3f9ef43f3f3f3f3f99463f3feaa23f3f9a6a3f3f90e23f3f
EUC-JP ???橈?????僥??瑤??嗚??絶?? 0011111100111111001111111101110011110110001111110011111100111111001111110011111111010001101001110011111100111111111101001010010000111111001111111101001111001011001111110011111111000000111001000011111100111111 3f3f3fdcf63f3f3f3f3fd1a73f3ff4a43f3fd3cb3f3fc0e43f3f
UTF-8 了묋뮅橈볩풄療뚩쾫僥뺧풐瑤뀐쉥嗚붼쐯絶뚪졇 111011111010011010111010111010111010110010001011111010111010111010000101111001101010100110001000111010111011001110101001111011011001001010000100111011111010011110000001111010111001101010101001111011001011111010101011111001011000001110100101111010111011101010100111111011011001001010010000111001111001000110100100111010111000000010010000111011001000100110100101111001011001011110011010111010111011011010111100111011001001000010101111111001111011010110110110111010111001101010101010111011001010000110000111 efa6baebac8bebae85e6a988ebb3a9ed9284efa781eb9aa9ecbeabe583a5ebbaa7ed9290e791a4eb8090ec89a5e5979aebb6bcec90afe7b5b6eb9aaaeca187
UHC 了묋뮅橈볩풄療뚩쾫僥뺧풐瑤뀐쉥嗚붼쐯絶뚪졇 111010001110011110010001111010001001001010010100111010001111101010010011111011111011111010001100111010001111111010001100111010001011001010000010111010001110100110010101111011111011111010010100111010001111110110110010111011111011110110101011111001111111000010010100111010011001110010010011111011111011111010001100111010011010000010111000 e8e791e89294e8fa93efbe8ce8fe8ce8b282e8e995efbe94e8fdb2efbdabe7f094e99c93efbe8ce9a0b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)