To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 町孟??窪?睛??鞨??町孟??窪?睛??鞨??^ 1001001010101100100101101101000000111111001111111000110001000101001111111110000111001011001111110011111111101000111000000011111100111111100100101010110010010110110100000011111100111111100011000100010100111111111000011100101100111111001111111110100011100000001111110011111101011110 92ac96d03f3f8c453fe1cb3f3fe8e03f3f92ac96d03f3f8c453fe1cb3f3fe8e03f3f5e
EUC-JP 町孟??窪?睛??鞨??町孟??窪?睛??鞨??^ 1100010010101110110011001101001000111111001111111011011110100110001111111110001011001101001111110011111111110000111000100011111100111111110001001010111011001100110100100011111100111111101101111010011000111111111000101100110100111111001111111111000011100010001111110011111101011110 c4aeccd23f3fb7a63fe2cd3f3ff0e23f3fc4aeccd23f3fb7a63fe2cd3f3ff0e23f3f5e
UTF-8 町孟렟렫窪렜睛뀀렒鞨렯렞町孟렟렫窪렜睛뀀렒鞨렯렞^ 11100111100101001011101011100101101011011001111111101011101000001001111111101011101000001010101111100111101010101010101011101011101000001001110011100111100111011001101111101011100000001000000011101011101000001001001011101001100111101010100011101011101000001010111111101011101000001001111011100111100101001011101011100101101011011001111111101011101000001001111111101011101000001010101111100111101010101010101011101011101000001001110011100111100111011001101111101011100000001000000011101011101000001001001011101001100111101010100011101011101000001010111111101011101000001001111001011110 e794bae5ad9feba09feba0abe7aaaaeba09ce79d9beb8080eba092e99ea8eba0afeba09ee794bae5ad9feba09feba0abe7aaaaeba09ce79d9beb8080eba092e99ea8eba0afeba09e5e
UHC 町孟렟렫窪렜睛뀀렒鞨렯렞町孟렟렫窪렜睛뀀렒鞨렯렞^ 11101111111010111101100011101011100011101011000010001110101110011110100011000001100011101010111011101111111011001011001011101011100011101010011111001010111010101000111010111100100011101010111111101111111010111101100011101011100011101011000010001110101110011110100011000001100011101010111011101111111011001011001011101011100011101010011111001010111010101000111010111100100011101010111101011110 efebd8eb8eb08eb9e8c18eaeefecb2eb8ea7caea8ebc8eafefebd8eb8eb08eb9e8c18eaeefecb2eb8ea7caea8ebc8eaf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)