To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??夕??????????????????? 0011111100111111100101110101101100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f975b3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??夕??????????????????悰 00111111001111111100110110111100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100011111011110111111100 3f3fcdbc3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f8fbdfc
UTF-8 센샵夕센셸렯렳렯렞센샵렯렳렯렞센샵렯렳렯렞悰 111011001000010010111100111011001000001110110101111001011010010010010101111011001000010010111100111011001000010110111000111010111010000010101111111010111010000010110011111010111010000010101111111010111010000010011110111011001000010010111100111011001000001110110101111010111010000010101111111010111010000010110011111010111010000010101111111010111010000010011110111011001000010010111100111011001000001110110101111010111010000010101111111010111010000010110011111010111010000010101111111010111010000010011110111001101000001010110000 ec84bcec83b5e5a495ec84bcec85b8eba0afeba0b3eba0afeba09eec84bcec83b5eba0afeba0b3eba0afeba09eec84bcec83b5eba0afeba0b3eba0afeba09ee682b0
UHC 센샵夕센셸렯렳렯렞센샵렯렳렯렞센샵렯렳렯렞悰 1011110010111110101111001010010111100000101010101011110010111110101111001101000010001110101111001000111011000000100011101011110010001110101011111011110010111110101111001010010110001110101111001000111011000000100011101011110010001110101011111011110010111110101111001010010110001110101111001000111011000000100011101011110010001110101011111111000011110101 bcbebca5e0aabcbebcd08ebc8ec08ebc8eafbcbebca58ebc8ec08ebc8eafbcbebca58ebc8ec08ebc8eaff0f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)