To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????\ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ?????????佚??????????\ 00111111001111110011111100111111001111110011111100111111001111110011111110011000110000110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f98c33f3f3f3f3f3f3f3f3f3f5c
EUC-JP ?????????佚??????????\ 00111111001111110011111100111111001111110011111100111111001111110011111111010000110001010011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3fd0c53f3f3f3f3f3f3f3f3f3f5c
UTF-8 溜븍젿溜븍젡溜븐뀛佚뚮젿溜븍젡溜븐뀓溜븒\ 11101111101001111000101111101011101110001000110111101100101000001011111111101111101001111000101111101011101110001000110111101100101000001010000111101111101001111000101111101011101110001001000011101011100000001001101111100100101111011001101011101011100110101010111011101100101000001011111111101111101001111000101111101011101110001000110111101100101000001010000111101111101001111000101111101011101110001001000011101011100000001001001111101111101001111000101111101011101110001001001001011100 efa78bebb88deca0bfefa78bebb88deca0a1efa78bebb890eb809be4bd9aeb9aaeeca0bfefa78bebb88deca0a1efa78bebb890eb8093efa78bebb8925c
UHC 溜븍젿溜븍젡溜븐뀛佚뚮젿溜븍젡溜븐뀓溜븒\ 1110101011111110101110101110101110100000101100011110101011111110101110101110101110100000100110101110101011111110101110101110110010000101100101001110110011101010100011001110101110100000101100011110101011111110101110101110101110100000100110101110101011111110101110101110110010000101100011011110101011111110100101010111100101011100 eafebaeba0b1eafebaeba09aeafebaec8594ecea8ceba0b1eafebaeba09aeafebaec858deafe95795c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)