To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 誤??修?????[誤??修?????[^ 10001100111010110011111100111111100011110100001100111111001111110011111100111111001111110101101110001100111010110011111100111111100011110100001100111111001111110011111100111111001111110101101101011110 8ceb3f3f8f433f3f3f3f3f5b8ceb3f3f8f433f3f3f3f3f5b5e
EUC-JP 誤??修?????[誤??修?????[^ 10111000111011010011111100111111101111011010010000111111001111110011111100111111001111110101101110111000111011010011111100111111101111011010010000111111001111110011111100111111001111110101101101011110 b8ed3f3fbda43f3f3f3f3f5bb8ed3f3fbda43f3f3f3f3f5b5e
UTF-8 誤볠쫲修뗨쫼礖듼븢[誤볠쫲修뗨쫼礖듼븢[^ 111010001010101010100100111010111011001110100000111011001010101110110010111001001011111110101110111010111001011110101000111011001010101110111100111001111010010010010110111010111001001110111100111010111011100010100010010110111110100010101010101001001110101110110011101000001110110010101011101100101110010010111111101011101110101110010111101010001110110010101011101111001110011110100100100101101110101110010011101111001110101110111000101000100101101101011110 e8aaa4ebb3a0ecabb2e4bfaeeb97a8ecabbce7a496eb93bcebb8a25be8aaa4ebb3a0ecabb2e4bfaeeb97a8ecabbce7a496eb93bcebb8a25b5e
UHC 誤볠쫲修뗨쫼礖듼븢[誤볠쫲修뗨쫼礖듼븢[^ 111010001010011010010011111001101010011010001010111000011111001110001011111010001010011010010011111001101010011010001010111000101001010110001011010110111110100010100110100100111110011010100110100010101110000111110011100010111110100010100110100100111110011010100110100010101110001010010101100010110101101101011110 e8a693e6a68ae1f38be8a693e6a68ae2958b5be8a693e6a68ae1f38be8a693e6a68ae2958b5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)