To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 諺?????蹂⑤?}v諺?????蹂⑤?}vB 1000110010111111001111110011111100111111001111110011111111100110111110001000011101000100001111110111110101110110100011001011111100111111001111110011111100111111001111111110011011111000100001110100010000111111011111010111011001000010 8cbf3f3f3f3f3fe6f887443f7d768cbf3f3f3f3f3fe6f887443f7d7642
EUC-JP 諺??嫄??蹂??}v諺??嫄??蹂??}vB 10111000110000010011111100111111100011111011101010100001001111110011111111101100111110100011111100111111011111010111011010111000110000010011111100111111100011111011101010100001001111110011111111101100111110100011111100111111011111010111011001000010 b8c13f3f8fbaa13f3fecfa3f3f7d76b8c13f3f8fbaa13f3fecfa3f3f7d7642
UTF-8 諺뚣렗嫄쇔깷蹂⑤뮢}v諺뚣렗嫄쇔깷蹂⑤뮢}vB 1110100010101011101110101110101110011010101000111110101110100000100101111110010110101011100001001110110010000111100101001110101010111001101101111110100010111001100000101110001010010001101001001110101110101110101000100111110101110110111010001010101110111010111010111001101010100011111010111010000010010111111001011010101110000100111011001000011110010100111010101011100110110111111010001011100110000010111000101001000110100100111010111010111010100010011111010111011001000010 e8abbaeb9aa3eba097e5ab84ec8794eab9b7e8b982e291a4ebaea27d76e8abbaeb9aa3eba097e5ab84ec8794eab9b7e8b982e291a4ebaea27d7642
UHC 諺뚣렗嫄쇔깷蹂⑤뮢}v諺뚣렗嫄쇔깷蹂⑤뮢}vB 1110010111101100100011001110001110001110101011001110101010110001101111001110010110000011101001011110101110110011101010001110101110010010101011100111110101110110111001011110110010001100111000111000111010101100111010101011000110111100111001011000001110100101111010111011001110101000111010111001001010101110011111010111011001000010 e5ec8ce38eaceab1bce583a5ebb3a8eb92ae7d76e5ec8ce38eaceab1bce583a5ebb3a8eb92ae7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)