To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 馭??乙??魏??馭??馭??乙??魏??馭??B 111010010110011000111111001111111000100110110011001111110011111111101001101100000011111100111111111010010110011000111111001111111110100101100110001111110011111110001001101100110011111100111111111010011011000000111111001111111110100101100110001111110011111101000010 e9663f3f89b33f3fe9b03f3fe9663f3fe9663f3f89b33f3fe9b03f3fe9663f3f42
EUC-JP 馭??乙??魏??馭??馭??乙??魏??馭??B 111100011100011100111111001111111011001010110101001111110011111111110010101100100011111100111111111100011100011100111111001111111111000111000111001111110011111110110010101101010011111100111111111100101011001000111111001111111111000111000111001111110011111101000010 f1c73f3fb2b53f3ff2b23f3ff1c73f3ff1c73f3fb2b53f3ff2b23f3ff1c73f3f42
UTF-8 馭곥룊乙쀯㎖魏뉗뒏馭곸캊馭곥룊乙쀯㎖魏뉗뒏馭곸캊B 11101001101001101010110111101010101100111010010111101011101000111000101011100100101110011001100111101100100000001010111111100011100011101001011011101001101011011000111111101011100010011001011111101011100100101000111111101001101001101010110111101010101100111011100011101100101110101000101011101001101001101010110111101010101100111010010111101011101000111000101011100100101110011001100111101100100000001010111111100011100011101001011011101001101011011000111111101011100010011001011111101011100100101000111111101001101001101010110111101010101100111011100011101100101110101000101001000010 e9a6adeab3a5eba38ae4b999ec80afe38e96e9ad8feb8997eb928fe9a6adeab3b8ecba8ae9a6adeab3a5eba38ae4b999ec80afe38e96e9ad8feb8997eb928fe9a6adeab3b8ecba8a42
UHC 馭곥룊乙쀯㎖魏뉗뒏馭곸캊馭곥룊乙쀯㎖魏뉗뒏馭곸캊B 11100101110111111000000111100011100011111000100111101011111000001001011111101111101001111010001011101010111000001000011111101100100010101000110011100101110111111000000111101100101011111001010111100101110111111000000111100011100011111000100111101011111000001001011111101111101001111010001011101010111000001000011111101100100010101000110011100101110111111000000111101100101011111001010101000010 e5df81e38f89ebe097efa7a2eae087ec8a8ce5df81ecaf95e5df81e38f89ebe097efa7a2eae087ec8a8ce5df81ecaf9542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)