To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????M 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4d
SJIS-WIN ?????????????????????M 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4d
EUC-JP ?????????????????????M 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4d
UTF-8 溜삳젗溜븅갱嶪쎈젍溜븍젶溜뷸솹溜볥졋燎덈젍M 11101111101001111000101111101100100000101011001111101100101000001001011111101111101001111000101111101011101110001000010111101010101100001011000111100101101101101010101011101100100011101000100011101100101000001000110111101111101001111000101111101011101110001000110111101100101000001011011011101111101001111000101111101011101101111011100011101100100001101011100111101111101001111000101111101011101100111010010111101100101000011000101111101111101001111000000011101011100011011000100011101100101000001000110101001101 efa78bec82b3eca097efa78bebb885eab0b1e5b6aaec8e88eca08defa78bebb88deca0b6efa78bebb7b8ec86b9efa78bebb3a5eca18befa780eb8d88eca08d4d
UHC 溜삳젗溜븅갱嶪쎈젍溜븍젶溜뷸솹溜볥졋燎덈젍M 11101010111111101011101111101011101000001001001111101010111111101011101011101001101100001011101111100101111101011011110111101011101000001000111011101010111111101011101011101011101000001010101011101010111111101011101011100110100110011010111011101010111111101001001111101011101000001011101011101000111110111000100011101011101000001000111001001101 eafebbeba093eafebae9b0bbe5f5bdeba08eeafebaeba0aaeafebae699aeeafe93eba0bae8fb88eba08e4d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)