To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?ョ???????}v?ョ???????}vB 00111111100000111000011100111111001111110011111100111111001111110011111100111111011111010111011000111111100000111000011100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f83873f3f3f3f3f3f3f7d763f83873f3f3f3f3f3f3f7d7642
EUC-JP 縯ョ?獒?????}v縯ョ?獒?????}vB 100011111101010011001011101001011110011100111111100011111100101110111011001111110011111100111111001111110011111101111101011101101000111111010100110010111010010111100111001111111000111111001011101110110011111100111111001111110011111100111111011111010111011001000010 8fd4cba5e73f8fcbbb3f3f3f3f3f7d768fd4cba5e73f8fcbbb3f3f3f3f3f7d7642
UTF-8 縯ョㅊ獒붻쾷了몌슐}v縯ョㅊ獒붻쾷了몌슐}vB 1110011110111000101011111110001110000011101001111110001110000101100010101110011110001101100100101110101110110110101110111110110010111110101101111110111110100110101110101110101110101010100011001110110010001010100100000111110101110110111001111011100010101111111000111000001110100111111000111000010110001010111001111000110110010010111010111011011010111011111011001011111010110111111011111010011010111010111010111010101010001100111011001000101010010000011111010111011001000010 e7b8afe383a7e3858ae78d92ebb6bbecbeb7efa6baebaa8cec8a907d76e7b8afe383a7e3858ae78d92ebb6bbecbeb7efa6baebaa8cec8a907d7642
UHC 縯ョㅊ獒붻쾷了몌슐}v縯ョㅊ獒붻쾷了몌슐}vB 1110011011100000101010111110011110100100101110101110100010100011100101001110100010110010100011011110100011100111101110001110111110111101101101100111110101110110111001101110000010101011111001111010010010111010111010001010001110010100111010001011001010001101111010001110011110111000111011111011110110110110011111010111011001000010 e6e0abe7a4bae8a394e8b28de8e7b8efbdb67d76e6e0abe7a4bae8a394e8b28de8e7b8efbdb67d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)