To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??魏??循??娃?0認ラ?蟻??億??筍 1110100111110010001111110011111111101001101100000011111100111111100011110111101000111111001111111000100010100001001111111000001001001111100101000100011010000011100010010011111110001011011000010011111100111111100010011010110100111111001111111110001010100001 e9f23f3fe9b03f3f8f7a3f3f88a13f824f944683893f8b613f3f89ad3f3fe2a1
EUC-JP 鶯??魏??循??娃?0認ラ?蟻??億??筍 1111001011110100001111110011111111110010101100100011111100111111101111011101101100111111001111111011000010100011001111111010001110110000110001111010011110100101111010010011111110110101110000100011111100111111101100101010111100111111001111111110010010100011 f2f43f3ff2b23f3fbddb3f3fb0a33fa3b0c7a7a5e93fb5c23f3fb2af3f3fe4a3
UTF-8 鶯볦눖魏꾬㎟循녿짎娃븍0認ラ쉬蟻숇쾴億됰끝筍 111010011011011010101111111010111011001110100110111010111000100010010110111010011010110110001111111010101011111010101100111000111000111010011111111001011011111010101010111010111000010110111111111011001010011110001110111001011010100010000011111010111011100010001101111011111011110010010000111010001010101010001101111000111000001110101001111011001000100110101100111010001001111110111011111011001000100010000111111011001011111010110100111001011000010010000100111010111001000010110000111010111000000110011101111001111010110110001101 e9b6afebb3a6eb8896e9ad8feabeace38e9fe5beaaeb85bfeca78ee5a883ebb88defbc90e8aa8de383a9ec89ace89fbbec8887ecbeb4e58484eb90b0eb819de7ad8d
UHC 鶯볦눖魏꾬㎟循녿짎娃븍0認ラ쉬蟻숇쾴億됰끝筍 1110010110100011100100111110110010000111101100001110101011100000100001001110111110100111101100011110001011100000100001101110101110100011100110101110100011011111101110101110101110100011101100001110110011100011101010111110100110111101101011001110101111111100100110011110101110110010100010101110010111100010100010011110101110110011101000011110001011101100 e5a393ec87b0eae084efa7b1e2e086eba39ae8dfbaeba3b0ece3abe9bdacebfc99ebb28ae5e289ebb3a1e2ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)