To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??荊?????必儡?咐げ??脹???B 0011111100111111100011000111010000111111001111110011111100111111001111111001010101001011100110010101001100111111100110011111001110000010101100000011111100111111100100101010111100111111001111110011111101000010 3f3f8c743f3f3f3f3f954b99533f99f382b03f3f92af3f3f3f42
EUC-JP ?堞荊?????必儡?咐げ??脹???B 00111111100011111011100010100100101101111101010100111111001111110011111100111111001111111100100110101100110100011011010000111111110100101111010110100100101100100011111100111111110001001011000100111111001111110011111101000010 3f8fb8a4b7d53f3f3f3f3fc9acd1b43fd2f5a4b23f3fc4b13f3f3f42
UTF-8 뤋堞荊쾸쵍샅렟뤋必儡샅咐げ렗뤋脹컦샘그B 11101011101001001000101111100101101000001001111011101000100011011000101011101100101111101011100011101100101101011000110111101100100000111000010111101011101000001001111111101011101001001000101111100101101111111000010111100101100001001010000111101100100000111000010111100101100100101001000011100011100000011001001011101011101000001001011111101011101001001000101111101000100001001011100111101100101110111010011011101100100000111001100011101010101101111011100001000010 eba48be5a09ee88d8aecbeb8ecb58dec8385eba09feba48be5bf85e584a1ec8385e59290e38192eba097eba48be884b9ecbba6ec8398eab7b842
UHC 뤋堞荊쾸쵍샅렟뤋必儡샅咐げ렗뤋脹컦샘그B 100011111011101111110100110111001111101110101010101100101000111010101100100011111011101111110100100011101011000010001111101110111111100110110001110101101110110110111011111101001101110011111011101010101011001010001110101011001000111110111011111100111110110010110000100011111011101111111001101100011101011101000010 8fbbf4dcfbaab28eac8fbbf48eb08fbbf9b1d6edbbf4dcfbaab28eac8fbbf3ecb08fbbf9b1d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)