To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 巖??曖??曖??}v巖??曖??曖??}vB 1001101111011100001111110011111110011110010000100011111100111111100111100100001000111111001111110111110101110110100110111101110000111111001111111001111001000010001111110011111110011110010000100011111100111111011111010111011001000010 9bdc3f3f9e423f3f9e423f3f7d769bdc3f3f9e423f3f9e423f3f7d7642
EUC-JP 巖??曖??曖??}v巖??曖??曖??}vB 1101011011011110001111110011111111011011101000110011111100111111110110111010001100111111001111110111110101110110110101101101111000111111001111111101101110100011001111110011111111011011101000110011111100111111011111010111011001000010 d6de3f3fdba33f3fdba33f3f7d76d6de3f3fdba33f3fdba33f3f7d7642
UTF-8 巖꼷뢞曖럮뢈曖뜌뢌}v巖꼷뢞曖럮뢈曖뜌뢌}vB 1110010110110111100101101110101010111100101101111110101110100010100111101110011010011011100101101110101110011111101011101110101110100010100010001110011010011011100101101110101110011100100011001110101110100010100011000111110101110110111001011011011110010110111010101011110010110111111010111010001010011110111001101001101110010110111010111001111110101110111010111010001010001000111001101001101110010110111010111001110010001100111010111010001010001100011111010111011001000010 e5b796eabcb7eba29ee69b96eb9faeeba288e69b96eb9c8ceba28c7d76e5b796eabcb7eba29ee69b96eb9faeeba288e69b96eb9c8ceba28c7d7642
UHC 巖꼷뢞曖럮뢈曖뜌뢌}v巖꼷뢞曖럮뢈曖뜌뢌}vB 1110010011011100100001001000111110001111010110011110010011110010100011101000111110001111010001001110010011110010100011011000111110001111010010000111110101110110111001001101110010000100100011111000111101011001111001001111001010001110100011111000111101000100111001001111001010001101100011111000111101001000011111010111011001000010 e4dc848f8f59e4f28e8f8f44e4f28d8f8f487d76e4dc848f8f59e4f28e8f8f44e4f28d8f8f487d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)