To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?妻娼?妻創^ 0011111110001101110010001000111110101001001111111000110111001000100100010110111001011110 3f8dc88fa93f8dc8916e5e
EUC-JP ?妻娼?妻創^ 0011111110111010110010101011111010101011001111111011101011001010110000011100111101011110 3fbacabeab3fbacac1cf5e
UTF-8 렾妻娼렾妻創^ 11101011101000001011111011100101101001101011101111100101101010001011110011101011101000001011111011100101101001101011101111100101100010011011010101011110 eba0bee5a6bbe5a8bceba0bee5a6bbe589b55e
UHC 렾妻娼렾妻創^ 10001110110001101111010010100011111100111101111010001110110001101111010010100011111100111101110001011110 8ec6f4a3f3de8ec6f4a3f3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)