To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????C 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f43
SJIS-WIN ?????????????????????曖??C 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001111001000010001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f9e423f3f43
EUC-JP ?????????????????????曖??C 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101101110100011001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fdba33f3f43
UTF-8 溜삳젵溜븍젛溜삳젉溜븍젲溜삳젲溜븍젾溜삥컩曖쒕젽C 11101111101001111000101111101100100000101011001111101100101000001011010111101111101001111000101111101011101110001000110111101100101000001001101111101111101001111000101111101100100000101011001111101100101000001000100111101111101001111000101111101011101110001000110111101100101000001011001011101111101001111000101111101100100000101011001111101100101000001011001011101111101001111000101111101011101110001000110111101100101000001011111011101111101001111000101111101100100000101010010111101100101110111010100111100110100110111001011011101100100100101001010111101100101000001011110101000011 efa78bec82b3eca0b5efa78bebb88deca09befa78bec82b3eca089efa78bebb88deca0b2efa78bec82b3eca0b2efa78bebb88deca0beefa78bec82a5ecbba9e69b96ec9295eca0bd43
UHC 溜삳젵溜븍젛溜삳젉溜븍젲溜삳젲溜븍젾溜삥컩曖쒕젽C 11101010111111101011101111101011101000001010100111101010111111101011101011101011101000001001011111101010111111101011101111101011101000001000101111101010111111101011101011101011101000001010011011101010111111101011101111101011101000001010011011101010111111101011101011101011101000001011000011101010111111101011101111100110101100001001000111100100111100101001110011101011101000001010111101000011 eafebbeba0a9eafebaeba097eafebbeba08beafebaeba0a6eafebbeba0a6eafebaeba0b0eafebbe6b091e4f29ceba0af43

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)