To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????巽 0011111100111111001111110011111100111111001111110011111100111111001111111001001001000110 3f3f3f3f3f3f3f3f3f9246
EUC-JP ?????????巽 0011111100111111001111110011111100111111001111110011111100111111001111111100001110100111 3f3f3f3f3f3f3f3f3fc3a7
UTF-8 溜븍젿溜븍젚溜븐뀛巽 111011111010011110001011111010111011100010001101111011001010000010111111111011111010011110001011111010111011100010001101111011001010000010011010111011111010011110001011111010111011100010010000111010111000000010011011111001011011011110111101 efa78bebb88deca0bfefa78bebb88deca09aefa78bebb890eb809be5b7bd
UHC 溜븍젿溜븍젚溜븐뀛巽 1110101011111110101110101110101110100000101100011110101011111110101110101110101110100000100101101110101011111110101110101110110010000101100101001110000111011110 eafebaeba0b1eafebaeba096eafebaec8594e1de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)