To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ???簡???稈^ 0011111100111111001111111000101011001000001111110011111100111111111000100110001001011110 3f3f3f8ac83f3f3fe2625e
EUC-JP ??堉簡??堉稈^ 001111110011111110001111101101111111110110110100110010100011111100111111100011111011011111111101111000111100001101011110 3f3f8fb7fdb4ca3f3f8fb7fde3c35e
UTF-8 셈뤊堉簡셈뤊堉稈^ 11101100100001011000100011101011101001001000101011100101101000001000100111100111101100001010000111101100100001011000100011101011101001001000101011100101101000001000100111100111101010001000100001011110 ec8588eba48ae5a089e7b0a1ec8588eba48ae5a089e7a8885e
UHC 셈뤊堉簡셈뤊堉稈^ 1011110011000000100011111011101011101011101111001100101011011011101111001100000010001111101110101110101110111100110010101101100101011110 bcc08fbaebbccadbbcc08fbaebbccad95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)